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and methods for identification of inhibitors thereof 

(57) The present invention relates to the identifica- 
tion, isolation and purification of the catalytic domain of 
the human effector checkpoint protein kinase (hChkl). 
A 1 .70 crystal structure of the hChkl kinase domain in 
the active conformation is reported herein. The kinase 
domain of hChkl and its associated crystal structure is 
described for use in the discovery, identification and 
characterization of inhibitors of hChkl. This structure 
provides a three-dimensional description of the binding 
site of the hChkl for structure-based design of small 
molecule inhibitors thereof as therapeutic agents. Inhib- 
itors of hChkl find utility in the treatment of hyperprolif- 
erative disorders such as HIV and cancer. 
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Description 

[0001] This application claims priority from co-pending United States Provisional Application Serial Number 
60/162,887, filed November 1 , 1 999, the contents of which are incorporated by reference herein in their entirety. 

5 

FIELD OF THE INVENTION 

[0002] The present invention generally relates to cell cycle checkpoint kinases which are essential to cellular DNA 
damage responses and coordinating cell cycle arrest. The checkpoint kinases play a role in the surveillance and 

w response to DNA damage. The damage may result from external or internal forces. Such forces include but are not lim- 
ited to errors in replication, DNA base damage, DNA strand breaks, or exposure to radiation or cytotoxic chemicals. 
These checkpoint kinases are integral in the regulatory pathways leading to cell cycle arrest and apoptosis following 
DNA damage, giving the cell notice and time to correct lesions prior to the initiation of replication and chromosome sep- 
aration. The present invention more specifically relates to the isolation and purification of the catalytic domain of the 

is human effector checkpoint protein kinase (hChkl) and its use in the discovery, identification and characterization of 
inhibitors of same. 

BACKGROUND 

20 [0003] Cell growth, division and death is essential to the life cycle of multi-celled organisms. These processes and 
their regulation are strikingly similar across all eukaryotic species. Somatic cell division consists of two sequential proc- 
esses: DNA replication followed by chromosomal separation. The cell spends most of its time preparing for these 
events in a growth cycle (interphase) which in turn consists of three subphases: initial gap (G^, synthesis (S), and sec- 
ondary gap (G 2 ). In G 1( the cell, whose biosynthetic pathways were slowed during mitosis, resumes a high rate of bio- 

25 synthesis. The S phase begins when DNA synthesis starts and ends when the DNA content of the nucleus has doubled. 
The cell then enters G 2 , which lasts until the cell enters the final phase of division, mitotic (M). The M phase begins with 
nuclear envelope breakdown, chromosome condensation and formation of two identical sets of chromosomes which 
are separated into two new nuclei. This is followed by cell division (cytokinesis) in which each nuclei is separated into 
two daughter celts, which terminates the M phase and marks the beginning of interphase for the new cells. 

30 [0004] The sequence in which the cell cycle events proceed is tightly regulated such that the initiation of one cell 
cycle event is dependent upon the successful completion of the prior cell cycle event. The process of monitoring 
genome integrity and preventing cell cycle progress in the event of DNA damage has been described as a 'cell cycle 
checkpoint' (Hartwell, LH et at., Science, 246:629-634 (1989); Weinert et al., Genes and Dev., 8:652 (1994)]. Cell cycle 
checkpoints consist of signal transduction cascades which couple DNA damage detection to cell cycle progression. 

35 Checkpoints are control systems that coordinate cell cycle progression by influencing the formation, activation and sub- 
sequent inactivation of the cyclin-dependent kinases. Checkpoint enzymes are responsible for maintaining the order 
and fidelity of events of the cell cycle by blocking mitosis in response to unreplicated or damaged DNA. These enzymes 
prevent cell cycle progression at inappropriate times, maintain the metabolic balance of cells while the cell is arrested 
and in some instances can induce apoptosis (programmed cell death) when the requirements of the checkpoint have 

40 not been met (O'Connor, PM, Cancer Surveys, 29, 151-182 (1997); Nurse, P, Cell, 91, 865-867 (1997); Hartwell, LH et 
al, Science, 266, 1 821 -1 828 (1 994); Hartwell, LH et al., Science, 246, (1 989), supra ). 

[0005] One series of checkpoints monitors the integrity of the genome. Upon sensing DNA damage, these "DNA 
damage checkpoints'* block cell cycle progression in G-j & G 2 phases, and slow progression through S phase (O'Con- 
nor, PM, Cancer Surveys, 29 (1997), supra : Hartwell, LH et at, Science, 266, (1994), supra ). This action enables DNA 
45 repair to be completed before replication of the genome and subsequent separation of this genetic material into new 
daughter cell takes place. 

[0006] Various mutations associated with malignancy affect the cancer cells ability to regulate checkpoints, allow- 
ing cells with DNA damage the increased likelihood to continue replicating and to escape damage-mediated apoptosis 
These factors contribute to the genomic instability which drives the genetic evolution of human cancers and contributes 

so to the resistance of cancer cells to most current chemotherapy and radiotherapy intervention. 

[0007] Due to abnormalities in the p53 tumor suppressor pathway, most cancer cells lack a functional check- 
point control system. This makes them particularly vulnerable to abrogation of the last remaining barrier protecting them 
from the cancer killing effects of DNA damaging agents: the G 2 checkpoint. The G 2 DNA damage checkpoint ensures 
maintenance of cell viability by delaying progression into mitosis in cells that have suffered genomic damage. The G 2 

55 checkpoint is controlled by cell cycle checkpoint pathways which inhibit mitosis if previous events are incomplete or if 
the DNA is damaged. This regulation control system has been conserved from yeast to humans. Important in this con- 
served system is a kinase, Chk1 (or p56Chk1), which transduces signals from the DNA damage sensory complex to 
inhibit activation of the cyclin B/Cdc2 kinase which promotes mitotic entry (Peng, CY et al, Science, 277, 1501-1505 
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(1997); Sanchez Y, et al., Science, 277, 1497-1501 (1997); Walworth, N et al., Nature, 363(6427), 368-71 (May 27, 
1993); al-Khodairy et al., Mol Biol Cell, 5(2): 147-60 (Feb, 1994); Carr et al., CurrBiol., 5(10): 1179-90 (Oct. 1, 1995)). 
The repair checkpoint kinase, Chk1, regulates Cdc25, a phosphatase that activates Cdc2. Thus, Chk1 serves as the 
direct link between the G 2 checkpoint and the negative regulation of Cdc2. 
5 [0008] Inactivation of Chk1 has been shown to both abrogate G 2 arrest induced by DNA damage inflicted by either 
anticancer agents or endogenous DNA damage, as well as, result in preferential killing of the resulting checkpoint 
defective cells (Nurse, R Cell, 91, (1997), supra : Weinert, T, Science, 277, 1450-1451 (1997); Walworth, N et al., 
Nature, 363, (1993) supra : al-Khodairy et al., Molec. Biol. Cell, 5, (1994), supra : Wan, S et al., Yeast, 15(1 OA), 821-8 
(Jul. 1999)). 

10 [0009] The fact that cancer cells have also been shown to be more vulnerable to G 2 checkpoint abrogation has 
encouraged the pursuit of G 2 checkpoint abrogating drugs (Wang, Q et al., PNAS 96: 3706-371 1 (1 999); Fan, S et al., 
Cancer Res., 55, 1649-1654 (1995); Powell, SN et al., Cancer Res., 55, 1643-1648 (1995); Russell, KJ et al., Cancer 
Res., 55, 1639-1642 (1995); Wang, Q et al., J Natl Cancer Inst., 88, 956-967 (1996)). Such checkpoint abrogating 
drugs could improve the killing of tumors exposed to DNA damaging events including that inflicted by therapeutic 

15 agents, hypoxic-stress induced because of a limited blood supply (anti-angiogenic agents), or endogenous DNA dam- 
age arising as a consequence of a cancer cell's inherent genomic instability. Selective manipulation of checkpoint con- 
trol in cancer cells can afford broad utilization in cancer chemotherapeutic and radiotherapy regimens and may in 
addition, offer a common hallmark of human cancer "genomic instability" to be exploited as the selective basis for the 
destruction cancer cells. 

20 [0010] A number of lines of evidence place Chk1 as a pivotal target in DNA damage checkpoint control. However, 
Chk1 is a difficult enzyme to study because the full length protein is not the most active form of Chk1 . While others have 
examined the nucleotide and amino acid sequence of the full-length checkpoint kinase and estimated the location of the 
kinase domain, there is a need for the isolation and purification of the kinase domain of Chk1 and the maintenance of 
its catalytically active conformation. 

25 

SUMMARY OF THE INVENTION 

[0011] The generation, kinetic characterization, and structure determination of the kinase domain of the human 
Chk1 protein is disclosed herein. The domain begins between residues 1 and 16 and terminates between residues 265 
30 and 291 of the full length protein [SEQ ID NO. 2] which comprises 476 amino acids. The domain preferably extends 
from residues 1-265, more preferably from residues 1-289. 

[0012] The invention relates to an isolated, purified polynucleotide which encodes the active conformation of the 
human Chk1 kinase or an active kinase analog thereof. The polynucleotide may be natural or recombinant. 
[0013] The invention also relates to an isolated, soluble catalytically active polypeptide comprising the active con- 
35 formation of the human Chk1 kinase or an active kinase analog thereof. 

[0014] The invention encompasses both the polypeptide per se as well as salts thereof. As discussed in detail 
below, a high salt concentration (about 500 mM) in the buffer is used herein to prevent aggregation of peptide during 
purification and storage. 

[0015] The invention also relates to a crystal structure of the human Chk1 kinase in the active conformation 
40 resolved to at least 2.5Q. preferably 2.0O, more preferably 1.70- This structure provides a three-dimensional 
description of the target (human Chk1) for structure-based design of small molecule inhibitors thereof as therapeutic 
agents. 

[0016] The invention further relates to an expression vector for producing catalytically active human Chk1 kinase in 
a host cell. 

45 [001 7] The invention further relates to a host cell stably transformed and transfected with a polynucleotide encoding 
of the human Chk1 kinase, or fragment thereof; or an active kinase analog thereof, in a manner allowing the expression 
of the human Chk1 kinase in the active configuration. 

[0018] The present invention further discloses methods for screening candidate compounds using the molecular 
structure of the x-ray crystallography data to model the binding of candidate compounds. 

so [0019] The invention further provides a method for designing and screening potentially therapeutic compounds for 
the treatment of hyper-pro liferative or diseases related to proliferation, including but not limited to cancer and HIV infec- 
tion. The putative therapeutics can be screened for activities such as (1 ) potentiation of the cytotoxicity of DNA damag- 
ing agents such as synthetic or natural chemotherapeutic agents and ionizing or neutron radiation; (2) enhancement of 
the cytotoxicity of DNA synthesis inhibitors including antimetabolites, DNA chain terminators, or other mechanisms that 

55 would lead to the inhibition of DNA synthesis; (3) enhancement of the cytotoxicity of hypoxia as would occur within 
tumors due to a limited blood supply; and (4) inhibition of the ability of HIV to arrest cell cycle progression such as that 
induced by the VPR protein. Compounds that inhibit human Chk1 kinase activity or abrogate the G2 checkpoint can be 
used to treat or prevent the hype rpro life ration associated with cancer and HIV. 
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[0020] The present invention provides methods for identifying potential inhibitors of the human Chk1 protein kinase 
by de novo design of novel drug candidate molecules that bind to and inhibit human Chk1 protein kinase activity, or that 
improve their potency. The x-ray crystallographic coordinates disclosed herein, allow generation of 3-dimensional mod- 
els of the catalytic site and the drug binding site of the human Chk1 protein. De novo design comprises of the genera- 
5 tion of molecules via the use of computer programs which build and link fragments or atoms into a site based upon 
steric and electrostatic complementarity, without reference to substrate analog structures. The drug design process 
begins after the structure of the target (human Chk1 kinase) is solved to at least a resolution of 2.5_. Refinement of the 
structure to a resolution of 2.0 A or better with fixed water molecules in place provides more optimal conditions to under- 
take drug design. 

w [0021] The invention further provides a method for computational modeling of the kinase domain of human Chk1 , 
such a model being useful in the design of compounds that interact with this domain. The method involves crystallizing 
the Chk1 kinase in the catalytically active configuration; resolving the x-ray structure of said active kinase, particularly 
the kinase domain and binding site of active Chk1; and applying the data generated from resolving the x-ray structure 
to a computer algorithm capable of generating a three dimensional model of the kinase domain and binding site suitable 

75 for use in designing molecules that will act as agonists or antagonists to the polypeptide. An iterative process can then 
be applied to various molecular structures using the computer-generated model to identify potential agonists or antag- 
onists of the Chk1 kinase. Inhibitors of the kinase can serve as lead compounds for the design of potentially therapeutic 
compounds for the treatment of diseases or disorders associated with hyperproliferation or related to proliferation, such 
as cancer and HIV. 

20 [0022] The invention further provides a process where the human Chk1 protein kinase is modified by deletion of the 
C-terminal portion of the protein so as to impart favorable physical characteristics of the resulting polypeptide. The 
kinase domain is suitable for analysis by nuclear magnetic resonance, high throughput screening, biochemical charac- 
terizations, x-ray crystallography, colorimetry and other diagnostic means. The most preferred deletion fragment 
extends from residue 1 to residue 289. 

25 [0023] The invention further provides screening methods for use in the drug design process of potential agents to 
the human Chk1 protein kinase by de novo design of novel drug candidate molecules with potentially nanomolar poten- 
cies. The x-ray crystallographic coordinates disclosed based on the kinase domain of the human Chk1 protein will allow 
the generation of 3-dimensional models of the active binding sites of the human Chk1 protein. 
[0024] The invention further provides a method for rapidly screening compounds to identify those compounds that 

30 inhibit Chk1 kinase or core structure for further Chk1 inhibitor design. The high throughput-screening assay is capable 
of being fully automated on robotic workstations. The assay may be radioactive. However, in a preferred embodiment 
the assay is a non -radioactive ELISA. In a more preferred embodiment, the assay is an ELISA that utilizes a novel anti- 
body, rabbit anti-phosphosyntide, to specifically detect the product of the Chk1 kinase reaction in which biotin-syntide 
is the substrate. However, the basis of the assay includes the ability to use other substrates detectable by anti-phospho- 

35 peptide/ protein antibodies. The assay may be used to screen large collections of compound libraries to discover Chk1 
kinase inhibitors and potential lead compounds for the development of Chk1 kinase selective anticancer compounds. 
The assay finds utility in the screening of other syntide substrate kinase reactions involving kinases of analogous activ- 
ity to Chk1. 

40 BRIEF DESCRIPTION OF THE DRAWINGS 
[0025] 

Figure 1. The G 2 DNA damage checkpoint mechanism in fission yeast (Furnari et al., Science, 277: 1495-1497 
45 (Sep. 5, 1997). 

Figure 2. Sequence alignment of Chk1 kinase domains of human (hs) (SEQ ID NO: 2), mouse (mm) (SEQ ID NO: 
18), Xenopus (xl) (SEQ ID NO: 19) fruit fly (dm) (SEQ ID NO: 20), C. elegans (ce) (SEQ ID NO: 21), S. cerevisiae 
(sc) (SEQ ID NO: 22), and S. pombe (sp) (SEQ ID NO: 23). Secondary structural elements of human Chk1 are 
so shown above the alignment. The numbers of amino acids are shown on the right. Invariant residues among these 
species are in red and human Chk1 residues that also conserved in other species are in cyan. 

Figure 3. The homology model of Chk1 kinase depicting the activation loop and its relationship to the catalytic loop 
and C helix. The Chk1 N and C-terminal lobes are shown. The fragments corresponding to the Chk1 C-helix are 
55 residues 50-58; the Chk1 catalytic loop are residues 129-132; and the Chk1 activation loop are residues 148-170. 

Figure 4. The purification scheme for Chk1 kinase domain 1 -289. 
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Figure 5. The structure of human Chk1 kinase domain identified using the crystal resolved to 1.7 A. A ribbon dia- 
gram of the binary complex structure of Chk1 with AMP-PNP showing the secondary structural elements and the 
loops discussed in the text. The a-helices are shown in blue, the (3-strands in cyan, the catalytic loop in orange, the 
activation loop in red. AMP-PNP and sulfate ion are shown as ball and stick models. The termini are denoted by N 
5 and C. 

Figure 6. Catalytic site of Chk1 . Cross section of the catalytic site of human Chk1 with AMP-PNP. Protein C oc-rib- 
bon representations are shown in purple for Chk1 . The side chains of the catalytic site residues are shown as ball 
and stick models and are color-coded by atom type: carbon, green; nitrogen, blue; oxygen, red. The distances (_) 
10 along the dotted lines between the catalytic site residues are shown. 

Figure 7. Molecular surface of the Chk1 with modeled CDC25C peptide. The molecular surface of Chk1 is colored 
as follows: basic side chains are shown in blue, acidic side chains in red, and non-polar side chains in violate. 
CDC25C peptide (residues 21 1-219) is shown as tick model and color-coded by atom type: carbon, green; nitro- 
15 gen, blue; oxygen, red; sulfur, yellow. 

Figure 8. Stereoview of representative electron density map. Figure 8A shows a stereoview of a representative por- 
tion of the experimental density at 1 .5_ calculated to 3.0_ with the use of phases after solvent flattening. Super- 
imposed on the density is the final refined model. Figure 8B shows a difference Fourier map calculated with native 
20 model-derived phases and coefficients IFO(AMP-PNP)l-IFO(native/apoenzyme)l to the diffraction of 1 .7_and con- 
toured at 2.5_. The triphosphate moiety of AMP-PNP is disordered and is omitted from the model. No Mg 2+ ions 
are observed. 

Figure 9. Representation of the Chk1 binding sites, showing specifically the specificity pocket, the ATP binding site, 
25 and the Donor-Acceptor-Donor binding motif. 

Figure 10. The high throughput ELISA protocol. 

Figure 11. The Chk1 crystal coordinates for the apoenzyme (isolated active Chk1 — Figure 11 A) and the binary 
30 complex (Chk1 complexed with AMP-PNP, an ATP analog — Figure 11B) including the coordinates of the fixed 
water molecules. 

DETAILED DESCRIPTION OF THE INVENTION 

35 [0026] DNA damage induces the arrest of the cell cycle at the G 2 checkpoint. The G 2 DNA damage checkpoint 
ensures maintenance of cell viability by delaying progression into mitosis in cells which have suffered genomic damage. 
The G 2 checkpoint is controlled by cell cycle checkpoint pathways which have been extensively studied (Hartwell, LH 
et al., Science, 246 (1989), supra : Nurse, P et al., Nat Med, 4 (10): 1103-6 (Oct 1998); Peng et al., Science, 277, 
(1997), supra : Furnari et al., Science, 277: 1495-1497 (Sep. 5, 1997); Zeng et al., Nature 395 (6701 ):507-510 (Oct. 1, 

40 1998); Martinho et al., EMBO J, 17(24):7239-49 (Dec. 15, 1998); Nakajo et al., Dev. Biol. 207(2):432-44 (Mar. 15, 
1 999); Carr et al., Curr Biol., 5 (1 995), supra). The model of the checkpoint mechanism in fission yeast is shown in Fig- 
ure 1, Furnari, et al., Science, (1997), supra . As mentioned above, the regulation control system is highly conserved 
from yeast to humans. 

[0027] DNA damage activates the checkpoint pathway by inhibiting the dephosphorylation of the mitotic kinase 
45 Cdc2 at the tyrosine-15 residue [Cdc2 (Y 15 -P0 4 )], thereby inhibiting its mitotic initiating activity and arresting the cell 
cycle. This process is referred to as inhibitory phosphorylation. In order for mitosis to proceed, Cdc2 must be dephos- 
phorylated, returning it to its active form. Phosphorylated Cdc2 is the substrate of Cdc25. Cdc25 is a dual specificity 
protein phosphatase that controls entry into mitosis by dephosphorylating the protein kinase Cdc2. In fission yeast, 
DNA damage also results in the activation of Rad3, a kinase related to the ATM/ATR proteins. Rad3 initiates the Chk1 
so response; the phosphorylation of Chk1 is a Rad3 dependent process (Martinho et al., EMBO J, 17 (1 998), supra : Fur- 
nari et al., Science, 277 (1 997), supra ). Phosphorylated (active) Chk1 phosphorylates the mitotic inducer Cdc25 at the 
serine-216 residue of human Cdc25 [Cdc25 (S 216 -P0 4 )]. Phosphorylation of Cdc25 inhibits the function of the phos- 
phatase in the dephosphorylation of Cdc2, an event required for mitosis to proceed. Throughout interphase but not in 
mitosis, Cdc25 is phosphorylated at the serine-216 residue and bound to members of the highly conserved and ubiq- 
55 uitously expressed family of 14-3-3 proteins. Prevention of serine-216 phosphorylation prevents 14-3-3 binding, per- 
turbing mitotic timing and allowing cells to escape the G 2 checkpoint arrest induced by either unreplicated DNA or 
radiation induced damage. 

[0028] A majority of currently accepted cancer treatments involve the induction of DNA damage including the 
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administration of anticancer agents, chemotherapeutic agents, and radiation therapy. Cancer cells frequently become 
resistant to such therapies. It is suspected that such resistance is related to the innate ability of the cancer cells to arrest 
and repair the damage induced. If the cancer ceil was unable to arrest and repair, mitosis would proceed with the DNA 
damage intact. The downstream result would presumably be cell death as a result of the DNA damage. 

5 [0029] Treatments that include a mechanism for abrogating the endogenous checkpoint pathway and repair proc- 
ess would presumably be more effective in killing cancer cells. As many cancer cells already lack a checkpoint con- 
trol system, a therapy that involved the inhibition of the G 2 checkpoint would presumably force the cancer cells to 
proceed through mitosis without any feedback arrest and repair process. Hence, there is a clear utility for the inhibition 
of the activity of Chk1 , a pivotal kinase in the G 2 checkpoint pathway. As many of the same events that regulate the G 2 

w arrest subsequent to DNA damage also regulate the S phase delay following DNA damage, the inhibition of Chk1 finds 
utility in the regulation of S phase as well. 

[0030] The human Chk1 sequence of amino acids 1 to 476 is available through GenBank. Full length or segments 
of human Chk1 cDNA corresponding to codon 1-427, 1-265, and 1-289 were separately amplified by PCR. Each was 
tagged at its 3'-end with six histidine codons and cloned into an expression plasmid for protein production using a Bac- 
15 ulovirus/insect cell expression system. The protein was expressed in insect Hi-5 cells and purified by a combination of 
ion-exchange and affinity column chromatography. It was found that a high concentration of salt (-500 mM levels) was 
required for keeping the purified Chk1 kinase domain from forming a precipitate. 

[0031] The kinase activity of the hChkl was determined by monitoring the ADP production through enzymatic 
actions of pyruvate kinase and lactate dehydrogenase. The Chk1 kinase domain containing amino acids 1 -289 showed 
20 higher enzymatic activity than the full length protein. Unlike the other forms of Chk1 which have proven difficult to work 
with (isolate, purify, crystallize, etc), the 1-289 kinase domain form of the human Chk1 enzyme facilitated crystallogra- 
phy, enzyme characterization, and high throughput screening of inhibitors. In particular, the Chk1 kinase domain was 
used to determine its 3-dimensional structure, which provides unique structural information for inhibitor design for ther- 
apeutic development. 

25 [0032] As used herein, the abbreviation 'hChkl' refers to the polynucleotide encoding the human effector check- 
point kinase serving as a DNA damage/replication checkpoint kinase. The nucleic acid sequence of the polynucleotide 
encoding the full length protein of human Chk1 was published in Science by Sanchez et al. (Science, 277 (5331): 1497- 
1501 (1997)) and published in GenBank on September 9, 1997 (AF016582). The nucleic acid sequence described 
therein is provided herein, shown in SEQ ID NO. 1. The corresponding peptide sequence of the full length protein is 

30 provided herein, shown in SEQ ID NO. 2. This peptide sequence was submitted to GenBank by Flaggs et al. on Novem- 
ber 3, 1997 and released on December 13, 1997 (AF032874). The protein kinase was further described by Flaggs et 
al. in Current Biology (Curr. Biol., 7(12):977-986, (1997)). 

[0033] Using homology tools to examine the nucleotide and peptide sequence of Chk1 , scientists have attempted 
to estimate the location of the kinase domain. However, the exact location of the catalytically active kinase domain has 
35 been difficult to experimentally determine, primarily as no one has ever reported isolating the kinase domain in its active 
configuration. Previous publications have indicated that the kinase domain extends from AA 16 to AA 264 
(W099/1 1 1 795, published March 1 1 , 1999, at page 7, line 3) of SEQ ID NO. 2. 

[0034] We have found that the catalytic kinase domain begins between AA1 and 16 and terminates between AA265 
and AA291 of SEQ ID NO. 2. We further discovered that vector-driven protein yield is dramatically increased when a 

40 fragment extending from AA1 to AA289 (dubbed KH289) is used. 

[0035] There are 22 known amino acids but 64 possible permutations of nucleic acid triplets, called "codons". Many 
amino acids are specified by more than one codon, a phenomenon called degeneracy. Due to the degeneracy of the 
genetic code, there are many functionally equivalent nucleic acid sequences that can encode the same protein. The 
active human Chk1 kinase set forth in SEQ ID NO.2 can clearly be encoded by multiple nucleotide sequences and is 

45 not limited to the cDN A sequence set forth in SEQ ID NO. 1 . For example, both UUU and UUC code for a phenylalanine 
while serine is encoded by UCU, UCC, UCA, UCG, AGU, and AGC [Molecular Biology of the Gene . 4 th edition, Watson, 
J.D. et al., editors (1987) at pages 437-438]. Functionally equivalent sequences can readily be prepared using known 
methods such as modified primer PCR, site-directed mutagenesis, and chemical synthesis. Such functional equivalents 
are within the scope of this invention. 

so [0036] In the examples of the present invention, the full length form of human Chk1 protein kinase (AA 1-476) is 
referred to as KH476. Fragments thereof are identified by the amino acid sequence. For example, the human Chk1 
kinase domain (AA 1-289) is referred to as KH289 Other kinase domain sequences are referred to by amino acid num- 
bering in a similar manner. 

55 A. Peptides. Proteins and Antibodies 

[0037] As used herein, the terms "kinase" and "protein kinase" refer to enzymes that catalyze the transfer of a phos- 
phate residue from a nucleoside triphosphate to an amino acid side chain in selected targets. The covalent phosphor- 
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ylation in turn regulates the activity of the target protein. In addition, phosphorylation frequently acts as the signal that 
triggers a particular process or reaction, playing an integral part in cellular regulation and control mechanisms. Clearly, 
inappropriate or unregulated phosphorylation can result in errors in cell signaling and the associated cell cycle and reg- 
ulation processes. Most protein kinases are highly substrate specific. 
5 [0038] As used herein, a peptide is said to be "isolated" or "purified" when it is substantially free of homologous cel- 
lular material or chemical precursors or other chemicals. The peptides of the present invention can be purified to homo- 
geneity or other degrees of purity. The level of purification will be based on the intended use. 

[0039] In some uses, "substantially free of cellular material" includes preparations of the peptide having less than 
about 30% (by dry weight) other proteins (i.e., contaminating protein), less than about 20% other proteins, less than 
10 about 10% other proteins, or less than about 5% other proteins. When the peptide is recombinantly produced, it can 
also be substantially free of culture medium, i.e., culture medium represents less than about 20% of the volume of the 
protein preparation. 

[0040] The language "substantially free of chemical precursors or other chemicals" includes preparations of the 
peptide in which it is separated from chemical precursors or other chemicals that are involved in its synthesis. In one 
15 embodiment; the language "substantially free of chemical precursors or other chemicals" includes preparations of the 
kinase peptide having less than about 30% (by thy weight) chemical precursors or other chemicals, preferably less than 
about 20% chemical precursors or other chemicals, more preferably less than about 1 0% chemical precursors or other 
chemicals, or most preferably less than about 5% chemical precursors or other chemicals. 

[0041] The isolated kinase described herein can be purified from cells that naturally express it, purified from cells 
20 that have been altered to express it (recombination), or synthesized using known protein synthesis methods. For exam- 
ple, a nucleic acid molecule encoding the protein kinase is cloned into an expression vector, the expression vector intro- 
duced into a host cell and the protein expressed in the host cell. The protein can then be isolated from the cells by an 
appropriate purification scheme using standard protein purification techniques. Many of these techniques are described 
in detail below. 

25 [0042] The present invention also provides catalytically active variants of the peptides of the present invention, such 
as allelic/sequence variants of the peptides, non-naturally occurring recombinantly derived variants of the peptides, and 
orthologs and paralogs of the peptides. Such variants can be generated using techniques that are known by those 
skilled in the fields of recombinant nucleic acid technology and protein biochemistry. 

[0043] Such variants can readily be identified/made using molecular techniques and the sequence information dis- 
30 closed herein. Further, such variants can readily be distinguished from other peptides based on sequence and/or struc- 
tural homology to the peptides of the present invention. The degree of homo logy/identity present will be based primarily 
on whether the peptide is a functional (active) variant or non -functional (inactive) variant, the amount of divergence 
present in the paralog family and the evolutionary distance between the orthologs. 

[0044] To determine the percent identity of two amino acid sequences or two nucleic acid sequences, the 

35 sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a 
second amino acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded 
for comparison purposes). In a preferred embodiment, the length of a reference sequence aligned for comparison pur- 
poses is at least 30%, 40%, 50%, 60%, 70%, 80%, or 90% or more of the length of the reference sequence. The amino 
acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a 

40 position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position 
in the second sequence, then the molecules are identical at that position (as used herein amino acid or nucleic acid 
'identity' is equivalent to amino acid or nucleic acid 'homology'). The percent identity between the two sequences is a 
function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the 
length of each gap, which need to be introduced for optimal alignment of the two sequences. 

45 [0045] The comparison of sequences and determination of percent identity and similarity between two sequences 
can be accomplished using a mathematical algorithm. (Computational Molecular Biology, Lesk, A.M., ed., Oxford Uni- 
versity Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., Academic Press, 
New York, 1993; Computer Analysis of Sequence Data, Part 1, Griffin, A.M., and Griffin, H.G., eds., Humana Press, 
New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence 

50 Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991). In a preferred embodiment, 
the percent identity between two amino acid sequences is determined using the Needleman and Wunsch (J. MoL Biol. 
(48):444-453 (1970)) algorithm which has been incorporated into commercially available computer programs, such as 
GAP in the GCG software package, using either a Blossom 62 matrix or a PAM250 matrix, and a gap weight of 1 6, 14, 
12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent identity 

55 between two nucleotide sequences can be determined using the commercially available computer programs including 
the GAP program in the GCG software package (Devereux, J., et a/., Nucleic Acids Res. 12(1):387 (1984)), the NWS 
gap DNA CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1 , 2, 3, 4, 5, or 6. In another 
embodiment, the percent identity between two amino acid or nucleotide sequences is determined using the algorithm 
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of E. Meyers and W. Miller (CABIOS, 4:1 1 -17 (1989)) which has been incorporated into commercially available compu- 
ter programs, such as ALIGN (version 2.0), using a PAM1 20 weight residue table, a gap length penalty of 12 and a gap 
penalty of 4. 

[0046] The nucleic acid and protein sequences of the present invention can further be used as a "query sequence" 
5 to perform a search against sequence databases to, for example, identify other family members or related sequences. 
Such searches can be performed using commercially available search engines, such as the N BLAST and XBLAST pro- 
grams (version 2.0) of Altschul, et at. (J. Mol. Biol. 215:403-10 (1990)). Nucleotide searches can be performed with 
such programs to obtain nucleotide sequences homologous to the nucleic acid molecules of the invention. Protein 
searches can be performed with such programs to obtain amino acid sequences homologous to the proteins of the 
10 invention. To obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Alts- 
chul et al. (Nucleic Acids Res. 25(1 7): 3389-3402 (1997)). 

[0047] Full-length clones comprising one of the peptides of the present invention can readily be identified as having 
complete sequence identity to one of the kinases of the present invention as well as being encoded by the same genetic 
locus as the kinase provided herein. 

75 [0048] Allelic variants of a peptide can readily be identified as having a high degree (significant) of sequence 
homology/identity to at least a portion of the peptide as well as being encoded by the same genetic locus as the kinase 
peptide provided herein. As used herein, two proteins (or a region of the proteins) have significant homology when the 
amino acid sequences are typically at least about 70-75%, 80-85%, and more typically at least about 90-95% or more 
homologous. A significantly homologous amino acid sequence, according to the present invention, will be encoded by 

20 a nucleic acid sequence that will hybridize to a peptide encoding nucleic acid molecule under siringent conditions as 
more fully described below. 

[0049] Paralogs of a peptide can readily be identified as having some degree of significant sequence homol- 
ogy/identity to at least a portion of the kinase peptide, as being encoded by a gene from Drosophila, and as having sim- 
ilar activity or function. Two proteins will typically be considered paralogs when the amino acid sequences are typically 
25 at least about 70-75%, 80-85%, and more typically at least about 90-95% or more homologous through a given region 
or domain. Such paralogs will be encoded by a nucleic acid sequence that will hybridize to a kinase peptide encoding 
nucleic acid molecule under stringent conditions as more fully described below. 

[0050] Orthologs of a kinase peptide can readily be identified as having some degree of significant sequence 
homology/identity to at least a portion of the kinase peptide as well as being encoded by a gene from another organism. 

30 Preferred orthologs will be isolated from mammals, preferably human, for the development of human therapeutic tar- 
gets and agents, or other invertebrates, particularly insects of economical/agriculture importance, e.g. members of the 
Lepidopteran and Coleopteran orders, for the development of insecticides and insecticidal targets. Such orthologs will 
be encoded by a nucleic acid sequence that will hybridize to a kinase peptide encoding nucleic acid molecule under 
moderate to stringent conditions, as more fully described below, depending on the degree of relatedness of the two 

35 organisms yielding the proteins. 

[0051] Non-natu rally occurring variants of the kinases of the present invention can readily be generated using 
recombinant techniques. Such variants include, but are not limited to deletions, additions and substitutions in the amino 
acid sequence of the kinase. For example, one class of substitutions are conserved amino acid substitution. Such sub- 
stitutions are those that substitute a given amino acid in a kinase peptide by another amino acid of like characteristics. 

40 Typically seen as conservative substitutions are the replacements, one for another, among the aliphatic amino acids 
Ala, Val, Leu, and lie; interchange of the hydroxyl residues Ser and Thr; exchange of the acidic residues Asp and Glu; 
substitution between the amide residues Asn and Gin; exchange of the basic residues Lys and Arg; and replacements 
among the aromatic residues Phe, Tyr. Guidance concerning which amino acid changes are likely to be phenotypically 
silent are found in Bowie etal., Science 247:1306-1310 (1990). 

45 [0052] Variant kinases can be fully functional or can lack function in one or more activities. Fully functional variants 
typically contain only conservative variation or variation in non-critical residues or in non-critical regions. Functional var- 
iants can also contain substitution of similar amino acids, which result in no change or an insignificant change in func- 
tion. Alternatively, such substitutions may positively or negatively affect function to some degree. 
[0053] Non-functional variants typically contain one or more non-conservative amino acid substitutions, deletions, 

so insertions, inversions, or truncation or a substitution, insertion, inversion, or deletion in a critical residue or critical 
region. 

[0054] Amino acids that are essential for function can be identified by methods known in the art, such as site- 
directed mutagenesis or alanine-scanning mutagenesis (Cunningham e/a/., Science 244:1081-1085 (1989)). The lat- 
ter procedure introduces single alanine mutations at every residue in the molecule. The resulting mutant molecules are 
55 then tested for biological activity such as receptor binding or in vitro proliferative activity. Sites that are critical for binding 
can also be determined by structural analysis such as x-ray crystallography, nuclear magnetic resonance or photoaffin- 
ity labeling (Smith etal, J. Mol. Biol. 224:899-904 (1992); de Vos etal. Science 255:306-31 2 (1992)). Accordingly, the 
protein kinases of the present invention also encompass derivatives or analogs in which a substituted amino acid resi- 
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due is not one encoded by the genetic code; in which a substituent group is included; in which the mature polypeptide 
is fused with another compound, such as a compound to increase the half-life of the polypeptide (for example, polyeth- 
ylene glycol); or in which the additional amino acids are fused to the mature polypeptide, such as a leader or secretory 
sequence or a sequence for purification of the mature polypeptide or a pro-protein sequence. 

5 [0055] The present invention further provides for functional, active fragments of the Chk1 kinase domain. As used 
herein, a fragment comprises at least 8 or more contiguous amino acid residues from the protein kinase. Such frag- 
ments can be chosen based on the ability to retain one or more of the biological activities of the kinase or could be cho- 
sen for the ability to perform a function, e.g. act as an immunogen. Particularly important fragments are catalytically 
activate fragments, peptides which are, for example about 8 or more amino acids in length. Such fragments will typically 

w comprise a domain or motif of the kinase, e.g., active site or binding site. Further fragments contemplated by the 
present invention include, but are not limited to, domain or motif containing fragments, soluble peptide fragments, and 
fragments containing immunogenic structures. Predicted domains and functional sites available to those of skill in the 
art (e.g., by PROSITE analysis). 

[0056] Polypeptides often contain amino acids other than the 20 amino acids commonly referred to as the 20 nat- 
75 urally-occurring amino acids. Further, many amino acids, including the terminal amino acids, may be modified by natu- 
ral processes, such as processing and other post-translational modifications, or by chemical modification techniques 
known in the art. Common modifications that occur naturally in polypeptides are described in basic texts, detailed mon- 
ographs, and the research literature, and they are known to those of skill in the art. 

[0057] Known modifications include, but are not limited to, acetylation, acylation, ADP-ribosylation, amidation, cov- 
20 alent attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide 
derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross-linking, 
cyclization, disulfide bond formation, demethylation, formation of covalent crosslinks, formation of cystine, formation of 
pyroglutamate, formylation, gamma carboxylation, glycosylation, GPI anchor formation, hydroxylation, iodination, meth- 
ylation, myristoylation, oxidation, proteolytic processing, phosphorylation, phenylation, racemization, selenoylation, sul- 
25 fation, transfer-RNA mediated addition of amino acids to proteins such as arginylation, and ubiquiti nation. Such 
modifications are known to those of skill in the art and have been described in great detail in the scientific literature. Sev- 
eral particularly common modifications, glycosylation, lipid attachment; sulfation, gamma-carboxylation of glutamic acid 
residues, hydroxylation and ADP-ribosylation, for instance, are described in most basic texts, such as Proteins - Struc- 
ture and Molecular Properties, 2nd Ed., T.E. Creighton, W. H. Freeman and Company, New York (1 993). Many detailed 
30 reviews are available on this subject; such as by Wold, F, Posttranslational Covalent Modification of Proteins, B.C. 
Johnson, Ed., Academic Press, New York 1-12 (1983); Seifter etal. (Meth. EnzymoL 182: 626-646 (1990)) and Rattan 
era/. (Ann. N.Y.AcadSci. 653:48-62 (1992)). 

[0058] The peptides of the present invention can be attached to heterologous sequences to form chimeric or fusion 
proteins. Such chimeric and fission proteins comprise a peptide operatively linked to a heterologous protein having an 

35 amino acid sequence not substantially homologous to the kinase peptide. "Operatively linked" indicates that the peptide 
and the heterologous protein are fused in-frame. The heterologous protein can be fused to the N-terminus or C-termi- 
nus of the kinase peptide. The two peptides linked in a fusion peptide are typically derived from two independent 
sources, and therefore a fusion peptide comprises two linked peptides not normally found linked in nature. The two pep- 
tides may be from the same or different genome. 

40 [0059] In some uses, the fusion protein does not affect the activity of the peptide perse. For example, the fusion 
protein can include, but is not limited to, enzymatic fusion proteins, for example beta-galactosidase fusions, yeast two- 
hybrid GAL fusions, poly-His fusions, MYC-tagged, Hl-tagged and Ig fusions. Such fusion proteins, particularly poly-His 
fusions, can facilitate the purification of recombinant kinase peptide. In certain host cells (e.g., mammalian host cells), 
expression and/or secretion of a protein can be increased by using a heterologous signal sequence. 

45 [0060] A chimeric or fusion protein can be produced by standard recombinant DNA techniques. For example, DNA 
fragments coding for the different protein sequences are ligated together in-frame in accordance with conventional tech- 
niques. In another embodiment; the fusion gene can be synthesized by conventional techniques including automated 
DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using anchor primers which 
give rise to complementary overhangs between two consecutive gene fragments which can subsequently be annealed 

50 and re-amplified to generate a chimeric gene sequence (see Ausubel et at., Current Protocols in Molecular Biology, 
1 992). Moreover, many expression vectors are commercially available that already encode a fusion moiety (e.g., a GST 
protein). A kinase peptide-encoding nucleic acid can be cloned into such an expression vector such that the fusion moi- 
ety is linked in-frame to the kinase peptide. 

[0061] Herein, the term 'antibody 1 refers to a polypeptide or group of polypeptides which are comprised of at least 
55 one antibody combining site or binding domain, said binding domain or combining site formed from the folding of vari- 
able domains of an antibody molecule to form three dimensional binding spaces with an internal surface shape and 
charge distribution complementary to the features of an antigen epitope. The term encompasses immunoglobulin mol- 
ecules and immunologically active portions of immunoglobulin molecules, such as molecules that contain an antibody 
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combining site or paratope. Exemplary antibody molecules are intact immunoglobulin molecules, substantially intact 
immunoglobulin molecules and portions of an immunoglobulin molecule, including those known in the art as Fab, FabB, 
F(abB) 2 and F(v). 

5 B. Nucleic Acids and Polynucleotides 

[0062] The present invention provides isolated nucleic acid molecules that encode the functional, active kinases of 
the present invention. Such nucleic acid molecules will consist of, consist essentially of, or comprise a nucleotide 
sequence that encodes one of the kinase peptides of the present invention, an allelic variant thereof, or an ortholog or 
10 paralog thereof. 

[0063] As used herein, an "isolated" nucleic acid molecule is one that is separated from other nucleic acid present 
in the natural source of the nucleic acid. Preferably, an "isolated" nucleic acid is free of sequences which naturally flank 
the nucleic acid (i.e., sequences located at the 5' and 3' ends of the nucleic acid) in the genomic DNA or cDNA of the 
organism from which the nucleic acid is derived. However, there can be some flanking nucleotide sequences, for exam- 
15 pie up to about 5KB, particularly contiguous peptide encoding sequences and peptide encoding sequences within the 
same gene but separated by introns in the genomic sequence. The important point is that the nucleic acid is isolated 
from remote and unimportant flanking sequences such that it can be subjected to the specific manipulations described 
herein such as recombinant expression, preparation of probes and primers, and other uses specific to the nucleic acid 
sequences. 

20 [0064] Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of other 
cellular material, or culture medium when produced by recombinant techniques, or chemical precursors or other chem- 
icals when chemically synthesized. However, the nucleic acid molecule can be fused to other coding or regulatory 
sequences and still be considered isolated. 

[0065] For example, recombinant DNA molecules contained in a vector are considered isolated. Further examples 
25 of isolated DNA molecules include recombinant DNA molecules maintained in heterologous host cells or purified (par- 
tially or substantially) DNA molecules in solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts of 
the isolated DNA molecules of the present invention. Isolated nucleic acid molecules according to the present invention 
further include such molecules produced synthetically. 

[0066] The preferred classes of nucleic acid molecules that are comprised of the nucleotide sequences of the 
30 present are the full-length cDNA molecules and genes and genomic clones since some of the nucleic acid molecules 
provided in SEQ ID NO. 1 are fragments of the complete gene that exists in nature. A brief description of how various 
types of these nucleic acid molecules can be readily made/isolated is provided herein. 

[0067] Full-length genes may be cloned from known sequence using any one of a number of methods known in the 
art. For example, a method which employs XL-PCR (Perkin-Elmer, Foster City, Calif.) to amplify long pieces of DNA may 

35 be used. Other methods for obtaining full-length sequences are known in the art. 

[0068] The isolated nucleic acid molecules can encode the functional, active kinase plus additional amino or car- 
boxyl-terminal amino acids, such as those that facilitate protein trafficking, prolong or shorten protein half-life or facili- 
tate manipulation of a protein for assay or production, among other things. The isolated nucleic acid molecules include, 
but are not limited to, the sequence encoding the active kinase alone or in combination with coding sequences, such as 

40 a leader or secretory sequence (eg., a pre-pro or pro-protein sequence), the sequence encoding the active kinase, with 
or without the additional coding sequences, plus additional non-coding sequences, for example introns and non-coding 
5' and 3' sequences such as transcribed but non-translated sequences that play a role in transcription, mRNA process- 
ing (including splicing and polyadenylation signals), ribosome binding and stability of mRNA. In addition, the nucleic 
acid molecule may be fused to a marker sequence encoding, for example, a peptide that facilitates purification. 

45 [0069] Isolated nucleic acid molecules can be m the form of RNA, such as mRNA, or m the form DNA, including 
cDNA and genomic DNA, obtained by cloning or produced by chemical synthetic techniques or by a combination 
thereof The nucleic acid, especially DNA, can be double-stranded or single-stranded. Single-stranded nucleic acid can 
be the coding strand (sense strand) or the non-coding strand (anti-sense strand). 

[0070] The invention further provides nucleic acid molecules that encode functional fragments or variants of the 
so active kinases of the present invention. Such nucleic acid molecules may be naturally occurring, such as allelic variants 
(same locus), paralogs (different locus), and orthologs (different organism), or may be constructed by recombinant DNA 
methods or by chemical synthesis. Such non-naturally occurring variants may be made by mutagenesis techniques, 
including those applied to nucleic acid molecules, cells, or organisms. Accordingly, as discussed above, the variants 
can contain nucleotide substitutions, deletions, inversions and insertions. Variation can occur in either or both the cod- 
55 ing and non-coding regions. The variations can produce both conservative and no n -conservative amino acid substitu- 
tions. 

[0071] A fragment comprises a contiguous nucleotide sequence greater than 12 or more nucleotides. Further, a 
fragment could be at least 30, 40, 50, 100, 250 or 500 nucleotides in length. The length of the fragment will be based 
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on its intended use. For example, the fragment can encode epitope bearing regions of the peptide, or can be useful as 
DNA probes and primers. Such fragments can be isolated using the known nucleotide sequence to synthesize an oli- 
gonucleotide probe. A labeled probe can then be used to screen a cDNA library, genomic DNA library, or mRNA to iso- 
late nucleic acid corresponding to the coding region. Further, primers can be used in PCR reactions to clone specific 
5 regions of gene. 

[0072] A probe/primer typically comprises substantially a purified oligonucleotide or oligonucleolide pair. The oligo- 
nucleotide typically comprises a region of nucleotide sequence that hybridizes under stringent conditions to at least 
about 12, 20, 25, 40, 50 or more consecutive nucleotides. 

[0073] Orthologs, homologs, and allelic variants can be identified using methods known in the art. As described 
w above, these variants comprise a nucleotide sequence encoding a peptide that is typically 60-65%, 70-75%, 80-85%, 
and more typically at least about 90-95% or more homologous to the nucleotide sequence provided in SEQ ID NO. 1 
or a fragment of this sequence. Such nucleic acid molecules can readily be identified as being able to hybridize under 
moderate to stringent conditions, to the nucleotide sequence shown in SEQ ID NO. 1 or a fragment of the sequence. 
[0074] As used herein, the term "hybridizes under stringent conditions" is intended to describe conditions for 
15 hybridization and washing under which nucleotide sequences encoding a peptide at least 50-55% homologous to each 
other typically remain hybridized to each other. The conditions can be such that sequences at least about 65%, at least 
about 70%, or at least about 75% or more homologous to each other typically remains hybridized to each other. Such 
stringent conditions are known to those skilled in the art and can be found in Current Protocols in Molecular Biology, 
John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. One example of stringent hybridization conditions are hybridization in 6X 
20 sodium chloride/sodium citrate (SSC) at about 45C, followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50- 
65C. 

[0075] The nucleic acid molecules of the present invention are useful for probes, primers, chemical intermediates, 
and in biological assays. The nucleic acid molecules are useful as a hybridization probe for cDNA and genomic DNA to 
isolate full-length cDNA and genomic clones encoding the peptide described herein and to isolate cDNA and genomic 
25 clones that correspond to variants (alleles, orthologs, etc.) producing the same or related peptides described herein. 
[0076] The nucleic acid molecules are also useful as primers for PCR to amplify any given region of a nucleic acid 
molecule and are useful to synthesize antisense molecules of desired length and sequence. 

[0077] The nucleic acid molecules are also useful for constructing recombinant vectors. Such vectors include 
expression vectors that express a portion of; or all of, the peptide sequences. Vectors also include insertion vectors, 

30 used to integrate into another nucleic acid molecule sequence, such as into the cellular genome, to alter in situ expres- 
sion of a gene and/or gene product. For example, an endogenous coding sequence can be replaced via homologous 
recombination with all or part of the coding region containing one or more specifically introduced mutations. 
[0078] The nucleic acid molecules are also useful for expressing antigenic portions of the proteins. 
[0079] The nucleic acid molecules are also useful as probes for determining the chromosomal positions of the 

35 nucleic acid molecules by means of in situ hybridization methods. 

[0080] The nucleic acid molecules are also useful for designing ribozymes corresponding to all, or a part, of the 
mRNA produced from the nucleic acid molecules described herein. 

[0081] The nucleic acid molecules are also useful for constructing host cells expressing a part, or all, of the nucleic 
acid molecules and peptides. 

40 [0082] The nucleic acid molecules are also useful for constructing transgenic animals expressing all, or a part, of 
the nucleic acid molecules and peptides. 

[0083] The nucleic acid molecules are also useful for making vectors that express part, or all, of the peptides. 
[0084] The nucleic acid molecules are also useful as hybridization probes for determining the presence, level, form 
and distribution of nucleic acid expression. Accordingly, the probes can be used to detect the presence of; or to deter- 

45 mine levels of, a specific nucleic acid molecule in cells, tissues, and in organisms. The nucleic acid whose level is deter- 
mined can be DNA or RNA. Accordingly, probes corresponding to the peptides described herein can be used to assess 
expression and/or gene copy number in a given cell, tissue, or organism. These uses are relevant for diagnosis of dis- 
orders involving an increase or decrease in kinase protein expression relative to normal results. 
[0085] In vitro techniques for detection of mRNA include Northern hybridizations and in situ hybridizations. In vitro 

so techniques for detecting DNA includes Southern hybridizations and in situ hybridization. 

[0086] Probes can be used as a part of a diagnostic test kit for identifying cells or tissues that express a kinase pro- 
tein, such as by measuring a level of a receptor-encoding nucleic acid in a sample of cells from a subject e.g., mRNA 
or genomic DNA, or determining if a receptor gene has been mutated. 

55 C. Vectors and Host Cells 

[0087] The invention also provides vectors containing the nucleic acid molecules described herein. The term "vec- 
tor" refers to a vehicle, preferably a nucleic acid molecule, that can transport the nucleic acid molecules. When the vec- 
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tor is a nucleic acid molecule, the nucleic acid molecules are covalently linked to the vector nucleic acid. With this 
aspect of the invention, the vector includes a plasmid, single or double stranded phage, a single or double stranded 
RNA or DNA viral vector, or artificial chromosome, such as a BAC, PAC, YAC, OR MAC. Various expression vectors can 
be used to express polynucleotide encoding the active hChkl kinase. 
5 [0088] A vector can be maintained in the host cell as an extrachromosomal element where it replicates and pro- 
duces additional copies of the nucleic acid molecules. Alternatively, the vector may integrate into the host cell genome 
and produce additional copies of the nucleic acid molecules when the host cell replicates. 

[0089] The invention provides vectors for the maintenance (cloning vectors) or vectors for expression (expression 
vectors) of the nucleic acid molecules. The vectors can function in prokaryotic or eukaryotic cells or in both (shuttle vec- 
w tors). 

[0090] Expression vectors contain cis-acting regulatory regions that are operably linked in the vector to the nucleic 
acid molecules such that transcription of the nucleic acid molecules is allowed in a host cell. The nucleic acid molecules 
can be introduced into the host cell with a separate nucleic acid molecule capable of affecting transcription. Thus, the 
second nucleic acid molecule may provide a trans-acting factor interacting with the cis- regulatory control region to allow 

rs transcription of the nucleic acid molecules from the vector. Alternatively, a trans-acting factor may be supplied by the 
host cell. Finally, a trans-acting factor can be produced from the vector itself. It is understood, however, that in some 
embodiments, transcription and/or translation of the nucleic acid molecules can occur in a cell-free system. 
[0091] The regulatory sequence to which the nucleic acid molecules described herein can be operably linked 
include promoters for directing mRNA transcription. These include, but are not limited to, the left promoter from bacte- 

20 riophage A., the lac, TRR and TAC promoters from E. co//, the early and late promoters from SV40, the CMV immediate 
early promoter, the adenovirus early and late promoters, and retrovirus long-terminal repeats. 
[0092] In addition to control regions that promote transcription, expression vectors may also include regions that 
modulate transcription, such as repressor binding sites and enhancers. Examples include the SV40 enhancer, the 
cytomegalovirus immediate early enhancer, polyoma enhancer, adenovirus enhancers, and retrovirus LTR enhancers. 

25 [0093] In addition to containing sites for transcription initiation and control, expression vectors can also contain 
sequences necessary for transcription termination and, in the transcribed region a ribosome binding site for translation. 
Other regulatory control elements for expression include initiation and termination codons as well as polyadenylation 
signals. The person of ordinary skill in the art would be aware of the numerous regulatory sequences that are useful in 
expression vectors. Such regulatory sequences are described, for example, in Sambrook et al., (Molecular Cloning: A 

30 Laboratory Manual. 2nded. t Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, (1989)). 

[0094] A variety of expression vectors can be used to express a nucleic acid molecule. Such vectors include chro- 
mosomal, episomal, and virus-derived vectors, for example vectors derived from bacterial plasmids, from bacteri- 
ophage, from yeast episomes, from yeast chromosomal elements, including yeast artificial chromosomes, from viruses 
such as baculoviruses, papovaviruses such as SV40, Vaccinia viruses, adenoviruses, poxviruses, pseudorabies 

35 viruses, and retroviruses. Vectors may also be derived from combinations of these sources such as those derived from 
plasmid and bacteriophage genetic elements, eg. cosmids and phagemids. Appropriate cloning and expression vectors 
for prokaryotic and eukaryotic hosts are described in Sambrook et al., Molecular Cloning: A Laboratory Manual. 2nd 
ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, (1989). 

[0095] The regulatory sequence may provide constitutive expression in one or more host cells (i.e. tissue specific) 
40 or may provide for inducible expression in one or more cell types such as by temperature, nutrient additive, or exoge- 
nous factor such as a hormone or other ligand. A variety of vectors providing for constitutive and inducible expression 
in prokaryotic and eukaryotic hosts are known to those of ordinary skill in the art. 

[0096] The nucleic acid molecules can be inserted into the vector nucleic acid by well-known methodology. Gener- 
ally, the DNA sequence that will ultimately be expressed is joined to an expression vector by cleaving the DNA 

45 sequence and the expression vector with one or more restriction enzymes and then ligating the fragments together. 
Procedures for restriction enzyme digestion and ligation are known to those of ordinary skill in the art. 
[0097] The vector containing the appropriate nucleic acid molecule can be introduced into an appropriate host cell 
for propagation or expression using well-known techniques. Bacterial cells include, but are not limited to, E. coli, Strep- 
tomyces, and Salmonella typhimurium. Eukaryotic cells include, but are not limited to, yeast, insect cells such as 

so Drosophila, animal cells such as COS and CHO cells, and plant cells. 

[0098] As described herein, it may be desirable to express a peptide of the present invention as a fusion protein. 
Accordingly, the invention provides fusion vectors that allow for the production of such peptides. Fusion vectors can 
increase the expression of a recombinant protein, increase the solubility of the recombinant protein, and aid in the puri- 
fication of the protein by acting for example as a ligand for affinity purification. A proteolytic cleavage site may be intro- 

55 duced at the junction of the fusion moiety so that the desired peptide can ultimately be separated from the fusion moiety. 
Proteolytic enzymes include, but are not limited to, factor Xa, thrombin, and enterokinase. Typical fusion expression 
vectors include pGEX (Smith et al., Gene 67:31-40 (1988)), pMAL (New England Biolabs, Beverly, MA) and pRIT5 
(Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), maltose E binding protein, or protein A, 
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respectively, to the target recombinant protein. Examples of suitable inducible non-fusion £. coli expression vectors 
include pTrc (Amann etal, Gene 69:301-315 (1988)) and pET 11 d (Studier elaL, Gene Expression Technology: Meth- 
ods in Enzymology 785:60-89 (1990)). 

[0099] Recombinant protein expression can be maximized in a host bacteria by providing a genetic background 
5 wherein the host cell has an impaired capacity to proteolytically cleave the recombinant protein. (Gottesman, S., Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, California (1990) 1 19-128). Alter- 
natively, the sequence of the nucleic acid molecule of interest can be altered to provide preferential codon usage for a 
specific host cell, for example E. coli. (Wada et a\. t Nucleic Acids Res. 20:2\ 11-2118 (1 992)). 
[0100] The nucleic acid molecules can also be expressed by expression vectors that are operative in yeast Exam- 
w pies of vectors for expression in yeast e.g., S. cerevisiae include pYepSed (Baldari, era/., EMBO J. 6:229-234 (1 987)), 
pMFa (Kurjan era/., Cell 30:933-943(1 982)), pJRY88 (Schultz etal., Gene 54:1 13-123 (1987)), andpYES2 (Invitrogen 
Corporation, San Diego, CA). 

[0101] The nucleic acid molecules can also be expressed in insect cells using, for example, baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., Sf 9 cells) include the pAc 
75 series (Smith era/., Mol. CellBiol 3:2156-2165 (1983)) and the pVL series (Lucklow era/., Virology 770:31-39 (1989)). 
[0102] In certain embodiments of the invention, the nucleic acid molecules described herein are expressed in mam- 
malian cells using mammalian expression vectors. Examples of mammalian expression vectors include pCDM8 (Seed, 
B. Nature 329:840(1987)) and pMT2PC (Kanfman eta!., EMBO J. 6:187-195 (1987)). 

[0103] The expression vectors listed herein are provided by way of example only of the well-known vectors available 
20 to those of ordinary skill in the art that would be useful to express the nucleic acid molecules. Preferred vectors include 
the pET28a (Novagen, Madison, Wi), pAcSG2 (Pharmingen, San Diego, CA), and pFastBac (Life Technologies, Gaith- 
ersburg. MD). The person of ordinary skill in the art would be aware of other vectors suitable for maintenance propaga- 
tion or expression of the nucleic acid molecules described herein. These are found for example in Sambrook, J., Fritsh, 
E. R, and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring 
25 Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 

[0104] The invention also encompasses vectors in which the nucleic acid sequences described herein are cloned 
into the vector in reverse orientation, but operably linked to a regulatory sequence that permits transcription of anti- 
sense RNA. Thus, an antisense transcript can be produced to all, or to a portion, of the nucleic acid molecule 
sequences described herein, including both coding and non-coding regions. Expression of this antisense RNA is sub- 
30 ject to each of the parameters described above in relation to expression of the sense RNA (regulatory sequences, con- 
stitutive or inducible expression, tissue-specific expression). 

[0105] The invention also relates to recombinant host cells containing the vectors described herein. Host cells 
therefore include prokaryotic cells, lower eukaryotic cells such as yeast, other eukaryotic cells such as insect cells, and 
higher eukaryotic cells such as mammalian cells. Preferred host cells of the instant invention include E. coli and Sf9. 

35 [0106] The recombinant host cells are prepared by introducing the vector constructs described herein into the ceils 
by techniques readily available to the person of ordinary skill in the art. These include, but are not limited to, calcium 
phosphate transfection, DEAE-dextran-mediated transfection, cationic lipid-mediated transfection, e I ectropo ration, 
transduction, infection, lipofection, and other techniques such as those found in Sambrook, etal. (Molecular Cloning: A 
Laboratory Manual. 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Har- 

40 bor, NY, 1989). 

[0107] Host cells can contain more than one vector. Thus, different nucleotide sequences can be introduced on dif- 
ferent vectors of the same cell. Similarly, the nucleic acid molecules can be introduced either alone or with other nucleic 
acid molecules that are not related to the nucleic acid molecules such as those providing trans-acting factors for expres- 
sion vectors. When more than one vector is introduced into a cell, the vectors can be introduced independently, co-intro- 
45 duced or joined to the nucleic acid molecule vector. 

[0108] In the case of bacteriophage and viral vectors, these can be introduced into cells as packaged or encapsu- 
lated virus by standard procedures for infection and transduction. Viral vectors can be replication -competent or replica- 
tion-defective. In the case in which viral replication is defective, replication will occur in host cells providing functions that 
complement the defects. 

so [0109] Vectors generally include selectable markers that enable the selection of the subpopulation of cells that con- 
tain the recombinant vector constructs. The marker can be contained in the same vector that contains the nucleic acid 
molecules described herein or may be on a separate vector. Markers include tetracycline or ampiciliin-resistance genes 
for prokaryotic host cells and dihydrofolate reductase or neomycin resistance for eukaryotic host cells. However, any 
marker that provides selection for a phenotypic trait will be effective. 

55 [0110] While the active protein kinases can be produced in bacteria, yeast, mammalian cells, and other cells under 
the control of the appropriate regulatory sequences, cell- free transcription and translation systems can also be used to 
produce these proteins using RNA derived from the DNA constructs described herein. 

[0111] Where secretion of the peptide is desired, appropriate secretion signals are incorporated into the vector. The 
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signal sequence can be endogenous to the peptides or heterologous to these peptides. 

[0112] It is also understood that depending upon the host cell in recombinant production of the peptides described 
herein, the peptides can have various glycosylation patterns, depending upon the cell, or maybe non-glycosylated as 
when produced in bacteria In addition, the peptides may include an initial modified methionine in some cases as a result 

5 of a host-mediated process. 

[0113] The recombinant host cells expressing the peptides described herein have a variety of uses. First, the cells 
are useful for producing a kinase protein or peptide that can be further purified to produce desired amounts of kinase 
protein or fragments. Thus, host cells containing expression vectors are useful for peptide production. 
[0114] Host cells are also useful for conducting cell-based assays involving the kinase protein or kinase protein 

w fragments. Thus, a recombinant host cell expressing a native kinase protein is useful for assaying compounds that stim- 
ulate or inhibit kinase protein function. 

[0115] Host cells are also useful for identifying kinase protein mutants in which these functions are affected. If the 
mutants naturally occur and give rise to a pathology, host cells containing the mutations are useful to assay compounds 
that have a desired effect on the mutant kinase protein (for example, stimulating or inhibiting function) which may not be 
15 indicated by their effect on the native kinase protein. 

[0116] The following examples are provided for illustration purposes. 

Examples 

20 1 . Identification of the Catalytic Domain Sequence 

[0117] From the complete protein sequence for the human checkpoint effector kinase (Chk1, 476 residues) availa- 
ble through GenBank, using sequence alignment and structures for other kinases, a homology model was devised for 
the kinase domain of the Chk1 protein (See Figure 3). 

25 [0118] All protein kinases utilize ATP to phosphorylate their substrates, involving the transfer of a gamma phos- 
phate to a substrate hydroxyl group. Each kinase binds ATP with its own strength, a property that is correlated by meas- 
uring K/IC50. The ATP molecule consists of adenine, ribose and triphosphate moieties. Each of these moieties bind to 
the protein in the ATP binding site (or ATP pokket). The adenine moiety always binds to the protein backbone by forma- 
tion of two or three hydrogen bonds. The ribose moiety forms one to two hydrogen bonds with the protein side chains 

30 of amino acids that lay outside of the ATP pocket. The tri-phosphate moiety interacts with those catalytic amino acids 
of the kinase that are generally consistent across the whole protein kinase family. There is a limited specificity for each 
kinase within ATP binding groove. This region is referred to as the specificity pocket. Using the homology model, a sche- 
matic of the Chk1 binding site was developed, identifying the ATP binding site, the donor-acceptor-donor binding motif 
and the specificity pocket (See Figure 9). This binding site is the target for inhibitor development, e.g. the development 

35 of compounds or molecules that bind to this site to the extent that the kinase activity of the Chk1 protein is blocked or 
inhibited. The black and red color in Figure 9 represents the ATP binding groove; note, Ser 147 can contribute to the 
binding of inhibitor. The area designated by the blue color represents the region outside of the ATP pocket that can be 
used for enhancement of the specificity of binding. Finally, the area in pink represents the 'specificity pocket, that region 
that is very different from one protein to another. This site does not contribute to the ATP binding but can be used for 

40 the design of specific inhibitors. In other words, by utilizing that region of the Chk 1 binding site that is unique to Chk1 
(the specificity pocket), one may design compounds that specifically inhibit Chk1 without also inhibiting the various 
other kinase molecules that may not be targets of the inhibition therapy. 

[0119] Analysis of the C-termini of the kinase suggested that amino acids beyond residue 265 would enhance high 
level expression and/or maintain the appropriate crystal structure. The homology model showed this region to be flexi- 
45 ble, such that ending the kinase domain construct within this region can prevent the disruption of potential secondary 
structures. Specifically, cleaving the Chk1 protein anywhere between amino acid residues 263 and 265 would result in 
the destruction of helical interactions at the distal end. The homology model further predicted that the kinase segment 
should extend to at least residue 272 to 275 and may be further extended to residue 289-291 . 
[0120] In addition, including the extended region in the construct prevents the C-terminal histidine tag from interact- 
so ing with the kinase domain, making it accessible for affinity chromatography. Based on these analyses, construct 
KH289 was designed for the expression of Chk1 kinase domain of residue 1-289 with 6xHis-tag at its C-terminus. A cor- 
responding construct without the 6xHis-tag was also made. Two other constructs were designed based on the homol- 
ogy model: (1) kinase domain of residues 1-210 (KH210) and (2) kinase domain of residues 1-248 (KH248). 

55 2. Cloning 

[0121] Human Chk1 cDNA was cloned by PCR using Vent polymerase (New England Biolabs, Beverly, MA) from 
human thymus and testis Marathon -Ready cDNA (Clontech, Palo Alto, CA) with primers synthesized (Genset, LaJolla, 
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A) based on the published sequence [SEQ ID NO. 1] (GenBank Accession number AF1016582) [Sanchez, Science 
(1997), supra .] r following the instruction from the venders. Two overlapping sequences were amplified independently, 
one contained the sequence of nucleotides 35-830 of SEQ ID NO.1, and the other contained the sequence of nucle- 
otides 678-1480 of SEQ ID NO.1. These overlapping sequences cover the whole coding sequence of Chk1 plus 16 

5 base-pairs (bps) of 3'- untranslational region. The cDNA of 35-830 encodes the kinase domain of residues 1 -265. 
[0122] The PCR oligonucleotide primer sequences are listed in Table 1 . Restriction sites for cloning, codons for 
6xHis-tag, and the stop codon were engineered in the PCR primers. Restriction site Stul preceded Ncol site which over- 
lapped the initiation codon. Sacl site followed the stop codon. When included, codons for 6xHis-tag preceded the stop 
codon, so that an expressed protein would have a 6xHis-tag at its C-terminus. 

10 [0123] The amplified cDNA was cloned into expression cassette pCR-TOPO (plasmid from Invitrogen, Carlsbad, 
CA) following the vender's instruction and the sequences were verified by sequencing of both strands (Retrogen, San 
Diego, CA). The amplified cDNA sequence was identical to the sequence deposited in GenBank referenced above. The 
full-length Chk1 cDNA was constructed from these two overlapping cDNAs, ligating through the Clal restriction site at 
734-739. This full-length cDNA was used as PCR template to generate cDNA fragments for expression or directly to 

15 generate the full-length Chk1 expression vector. All the PCR products were cloned into pCR-TOPO for sequencing. 
Constructs were made for the expression of full-length Chk1 and various lengths of kinase domain with or without 
6xHis-tag. 



Table 1 



25 



35 



PCR Primers* 


Primer 


Sequence 


SEQ ID NO. 


chk6w 


GAG CTC AGT ACC ATC TAT CTT TTT TGA TGT CTG G 


3 


KH28 


GAG CTC AGT TGG TGG TGG TGG TGG TGT CCA CTG GGA GAC TCT 


4 


9 


GAC AC 




K289 


GAG CTC ATC CAC TGG GAG ACT CTG ACA C 


5 


Chk11 


CCA TGG AGC TCA AGA AAG GGG CAA AAA GG 


6 


K210 


GAG CTC ATT GGT CCC ATG GCA ATT CTC C 


7 


KH21 


GAG CTC AGT GGT GGT GGT GGT GGT GGT GGT CCC ATG GCA ATT 


8 


0 


CTCC 




K248 


GAG CTC ACT CAA CTA AGA TTT TAT GCA GCA G 


9 


KH24 


GAG CTC AGT GGT GGT GGT GGT GGT GCT CAA CTA AGA TTT TAT 


10 


8 ! 


GCA GCA G 





40 

3. Chk1 Antibodies 

[0124] Peptide NRVTEEAVAVKIVDMKRAVD (residues 28-47 of SEQ ID NO. 2) was selected for generating anti- 
body against N-terminus of human Chk1. Peptide DDKILVDFRLSKGDGLE (residues 434-450 of SEQ ID NO. 2) was 
45 selected for generating antibody against C-terminus of human Chk1. Rabbit polyclonal antibodies were ordered 
through the Custom Antibody Production Services from Research Genetics, Inc. (Huntsville, AL). Both antibodies 
detected recombinant or endogenous human Chk1 as expected. 

4. Fermentation 

50 

[0125] The overall scheme was follows. The 3' PCR primers were engineered to encode both untagged and tagged 
(with 6-histidine tag) proteins. The segment of cDNA for 1 -289 was cloned into a pFastBac plasmid (obtained from Life 
Technologies) and an Ndel site was introduced. A recombinant baculovirus was generated using the Bacmid system 
(obtained from Life Technologies). The protein (KH289) was expressed in Hi-5 insect cells and purified by a combina- 
55 tion of ion-exchange and affinity chromatography. The segments of cDNA for the full-length Chk1 (1-476AA) and the 
Chkl kinase domain (1-265AA) were cloned into pAcSG2 plasmid and recombinant baculovirus was generated using 
BaculoGold viral DNA (obtained from Invitrogen) and a modified CellFectin transfection (obtained from Life Technolo- 
gies) and plaque selection (obtained from Novagen) protocol. The expressed protein was purified using the chromatog- 
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raphy scheme described below. High salt concentration in buffers was found to be required to prevent precipitation of 
the purified proteins. The details of the protocol are discussed below. 

Generation of Expre ssion Plasmids 

5 

[0126] Plasrrtid pFastBac-Nde was modified from the pFastBad vector (Life Technologies, Gaithersburg, MD) by 
in vitro site-directed mutagenesis using the Muta-Gene in vitro Mutagenesis Kit (Bio-Rad, Hercules, CA) following the 
supplier's instruction. Two nucleotides were substituted in pFastBad using the following oligonucleotide: 

w TGA ATA ATC CGG CAT ATG TAT AGG TTT TTT [SEQ ID NO. 14] 

This created a unique Ndel site at the original translation start site for the polyhedrin protein. 

[0127] The amplified cDNA fragments were digested with the restriction enzyme Stul and Sacl and cloned to plas- 
mids pET28a (Novagen, Madison, Wl), pAcSG2 (Pharmingen, San Diego, CA), or pFastBac-Nde. The pET28a vector 

75 was used for protein expression in E.coli and pAcSG2 and pFast-Bac-Nde were used for protein expression in insect 
cells. To clone the cDNA fragments encoding Chk1 kinase domain with amino acids 1-289 (construct KH289) into the 
pFastBac-Nde, the cDNA fragment was excised from the pCR-TOPO plasmid with restriction enzymes Stul and Sacl, 
ligated between the blunt-ended Ndel site and Sacl site. Plasmids with correct insertion were analyzed by restriction 
enzyme digestion. The full-length Chk1 and the kinase domain of residues 1-265 (KH265) with or without C-terminal 

20 6xHis-tag were cloned into pAcSG2 using the restriction sites of Stul and Sacl. Expression vectors for kinase domain 
of residues 1-210 (KH210) and kinase domain of residues of 1-248 (KH248) were made in pFastBac-Nde. 
[0128] Expression in E.coli was done following the instructions supplied with the pET28a vector. Proteins 
expressed in the form of full-length Chk1 or kinase domain of residues 1 -265 or kinase domain of residues 1 -289 were 
in the insoluble fraction when analyzed by ReadyPreps Protein Preparation Kit (Epicentre Technologies, Madison, Wl). 

25 

Generation of Recombinant Viruses 

[0129] The Bac-to-Bac system (Life Technologies) was used to generate recombinant baculovirus for expression of 
the C-terminally 6xHis-tagged Chk1 kinase domain (amino acids 1-289, KH289) as instructed. Recombinant viruses 
30 were confirmed by PCR for the presence of Chk1 cDNA insertion. Protein expression was confirmed by SDS-PAGE or 
Western blot with the Chk1 polyclonal antibodies. The expression of KH289 appeared to be the highest among all the 
constructs. High titer stocks of recombinant viruses were generated by 2 to 3 rounds of amplification using Sf21 insect 
cells. 

[0130] Recombinant viruses for expression of the full-length Chk1 and kinase domain of residues 1 -265 were gen- 
35 erated by co-transfection of Sf21 cells with pAcSG2 vector and BaculoGold (PharMingen, San Diego, CA). 

Expression in Insect Cells 

[0131] The yield of active soluble protein obtained in the E. coli fermentation described above was impractical for 

40 large-scale experimentation. Therefore, an alternate fermentation system was developed. Insect cells Sf9 for viral 
amplification, and Hi-5 cells for protein production (both from Invitrogen, Carlsbad, CA, USA) were adapted to grow in 
insect medium contained 1% Fetal Bovine Serum (Life Technologies, Grand Island, NY, USA). Cells were propagated 
and maintained in suspension culture at 27°C in either Erlenmeyer shake flask (Corning # 4301 83) or in an upright roller 
bottle (Corning Inc., Corning, NY, USA # 25290-17000) with a loosened cap for aeration. The flasks were placed in a 

45 reciprocal refrigerated shaker (Innova 4343, New Brunswick Scientific, Edison, NJ, USA) at 120 rpm. The cell density 
was maintained at between 5 X 1 0 5 to 2 X 1 0 6 cells/ml by diluting the cultured cell suspension with a fresh pre-warmed 
(27°C) medium. The viability of insect cells was maintained at 98%. The viability of insect cells were determined by 
microscopic count of total stained cells by trypan blue versus the total cell number in a hemocytometer 
[0132] Sf9 insect cells were used for amplification for recombinant virus stock. The recombinant baculovirus from 

so a single plaque was pick up by a pipette tip and added to Sf9 cells monolayer in T-25 flask (Becton Dickinson Labware, 
Franklin Lakes, NJ, USA) with 10 ml medium SF900II and 1% of Fetal bovine Serum (Life Technologies, Grand Island, 
NY, USA) and incubated at 27°C. After 6 days, the culture supernatant was used as first generation of virus stock (P1 ) 
for further amplification of P2 and P3 virus stocks to 2-3 L. For large scale amplification of the P2 and P3 virus stock, 
P1 or P2 virus stock was added to Sf9 cells at a cell density of 1 X 10 6 cells/ml, the infection was carried out with Mul- 

55 tiplicity Of Infection (MOI) of 0.1 , cells were grown in suspension in 500ml of SF900II in 2 L roller bottle (Corning Inc., 
Corning, NY, USA) standing up right in a shaker incubator at 1 20 rpm at 27 ° C for 6 days. This process was repeated 
until 2-3 L viral stock (P3) were obtained. The titer of this virus stock was 1 to 5X10 8 p.f.u/mL. The viral titration was 
determined by the plaque assay method, with serial 10-fold dilution up to 10 8 fold. The viral stock was stored at 10°C, 
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and used for large scale protein production within 2 months to avoid viral instability. 

[0133] The Hi-5 insect cells (derived from Trichopiusia.ni cells ) which have been adapted to grow in medium Ex-cell 
401 (JRH Biosciences, Lenexa, KS, USA) with 1% Fetal Bovine Serium were used for protein production. The cells 
were grown in the upright roller bottle up to cell density at 2 X 1 0 6 cells/ml; and were used as seed cells for bioreactor 

5 culture. The cells were grown in a 20 L stirred bioreactor with working volume at 18L (Applikon Inc., Foster City, CA, 
USA). Air flow rate was operated at about 1 0 ml per min per liter culture fluid. The air was fortified by pure oxygen in 
order to maintain the Dissolve Oxygen (D0 2 ) at 50% of air saturation. The agitation was maintained at 200 rpm through- 
out the cultivation. Cell density was started at about 5 X 10 5 cells/ml and cells were infected when the density reached 
2 x 1 0 6 cells/ml. The MOI was 3 and the infection was carried out for 48 Hrs. After 48 hrs. of infection, the infected cells 

w were harvested by centrifugation at 3,000 rpm for 10 min, at 4°C by a refrigerated centrifuge (model PR-7000M, IEC, 
Needham Heights, MA, USA). The cell pellets were collected and stored at -80°C. 

5. Purification 

75 6X-His tagged KH289 

[0134] The basic purification scheme is depicted in Figure 4. Frozen cell pellets were thawed, suspended in ice- 
cold lysis buffer, and lyzed by microfluidizer (Microfluidics Corporation, Newton, MA). The lysis buffer contained 25 mM 
Tris-HCI, pH 8.0, 500 mM NaCI, 20 mM imidazole, and 14 mM _ -mercaptoethanol. The lysate was centrifuged for 40 

20 minutes at 40,000 rpm in a Ti45 rotor in Beckman L8-70M ultracentrifuge. The soluble fraction was flowed through a 
150 ml_ Q-Sepharose FastFlow anion exchange column (Pharmacia, Piscataway, NJ), then loaded onto a 40ml Ni-NTA 
agarose column (Qiagen, Santa Clarita, CA). After extensive washes with the lysis buffer, the column was eluted with 
240 ml of 20 mM to 300 mM imidazole gradient in the lysis buffer. Fractions containing the Chk1 kinase domain (KH289) 
were identified by SDS-PAGE and pooled. The pooled fractions were dialyzed in 25 mM Tris-HCI, pH 7.5, 500 mM NaCI, 

25 0.5 mM EDTA, and 5mM DTT overnight. The dialyzed pool was diluted with 1 .5 volumes of 25 mM Tris-HCI, pH 7.5, 20 
mM MgCI 2 , 8% glycerol, 5 mM DTT and loaded immediately onto a 40 ml ATP-Sepharose column. The column was 
eluted with 200 ml of 25 mM Tris-HCI, pH 7.5, 500 mM NaCI, 5 mM DTT, and 5% glycerol. Fractions containing KH289 
were pooled and concentrated in a Millipore Stirred Cell under 60 psi N 2 and loaded onto a 320 ml HiPrep Sephacryl 
gel-filtration column and eluted with the same buffer. Pooled fractions were concentrated to 7-7.5 mg/ml for crystallog- 

30 raphy or ~3 mg/ml for HTS. Protein was flash-frozen in liquid N 2 and stored at -80°C. 

[0135] Maintaining salt concentration around 500 mM NaCI including 5% glycerol was found to be crucial for pre- 
venting aggregation of Chk1 proteins during purification and storage without affecting the intended use. 

6X-His tagged KH265 and KH47P ChK1 

35 

[0136] Essentially the same methods were used to purify the full-length Chk1 and the kinase domain of residues 1- 
265 expressed in insect cells. The expression protein levels as measured after the Ni-NTA chromatography or the final 
yields were much lower than that of the KH289 (full length sequence). 

[0137] Gel-filtration HPLC has been used as a means of quality control. No significant difference was observed for 
40 samples stored at room temperature, 4°C, or -80°C for 4 days. The material eluted at a void volume that was less than 
0. 1%. 

6. Crystallization. Crystallography and Three-Dimensional Analysis 

45 [0138] The full length Chk1 protein (1 -476 AA) had proven to be difficult to crystallize until the active kinase domain 
(1 -289 AA) was identified. This active kinase was able to be expressed at the high concentration required for use in HTS 
and crystallography. The Chk1 data set was collected on MarlP345 under cryotemperature with stream freeze. The 
HB2-092 kinase domain preparation (1-289 AA) was first used. The initial data set at 2.35 Q was obtained with overall 
Rsym of 4.6% and overall mosaicity for the data set is 1 .2. Subsequent experiments with the HB2-101 (also a 1-289 

so clone) reached a 1.7 O resolution with mosaicity of 0.38 for the kinase domain using a crystal grown in refined condi- 
tions. Both the original and subsequent crystals have a space group P21 with one molecule per asymmetric unit. The 
results from the crystal lographic analysis are shown in Table 2 below. 
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Table 2: Statistics for the 


erystallographic analysis 




Crystal 


Natl 


Nat2 


AMP-PNP 


Hg 


Au 


Internal merging and scaling 












D f t-/ M 1 1 1 ir\n ( a\ 
ixCSC)! UUOI1 \J\f 


1.7 


2.1 


1.7 


2.4 




ivciinaiuna nicasurco 


162418 


46947 


107449 


64881 


I / 3 / Zo 


Unique reflection 


35032 


19145 


35285 


12821 


22086 


Completeness (%) 


93.6(88.3) 


95.4(94.6) 


94.1(91.1) 


95.4 (96.4) 


97.5 (84.8) 


Average I/a 


29.9 (9.0) 


15.47(4.38) 


26.4(12.5) 


27:1 (11.6) 


33.5(14.8) 


D 1 

K sym 


_> - 1. \ 1 0. 1 ) 




j.u { 1U.UJ 




4.2(118) 


SIRSAS analysis 












Resolution (A) 








15-3.0 


15-3.0 














Phacino nAU/orJ /QTO/O A 0\ 

rnasing power^ piK/oAo) 








o '>'7/l oo 


in/i vie 
2.3V/1.40 


Figure of merit (combined) 










0.764 


Refinement statistics 












Resolution range (A) 


7-1.7 


7-2.1 


7-1.7 






Reflections used 4 (F>loF ) 


30132 


15804 


31794 






Total nonhydrogen atoms 


2372 


2354 


2460 






Rcryst 5 (%) 


21.6 


20.8 


22.6 






Rfree 6 (%) 


23.5 


25.0 


24.9 






rmsd from ideal bond length (A) 0.005 


0.006 


0.010 






rmsd from ideal bond angle (°) 


1.30 


1.27 


1.58 






Average B (A^ ; all atoms) 


28.9 


29.7 


23.22 







Data for the outermost resolution shell are given in parentheses. 
N _ N 

1 Rsym =**II(h)-I(h)il/-l(h)i»100, 

h i=l hurl 

where I(h)i is the ith measurement of reflection h and 1 (b) is the mean value of the N equivalent reflections. 

2 Rcullis = ♦ 1 1 FPH +/- FP I - FH(calc) I / - I FPH +/- FP I for all centric reflections. 

3 Phasing power = r.m.s. ( I FH 1 / E ), where I FH I is the heavy-atom structure factor amplitude and E is the residual 
lack of closure. 



Number of reflections used in working set. 

Rcryst s • I IFobsl - 1 IFcalcl IHFdbsl, where summation is over data used in the refinement 
Rfree is the same calculation including the 10% of data excluded from all refinements. 
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[0139] Crystals were grown at 13°C using a hanging-drop vapor-diffusion method. Two crystallization conditions 
produced the exact same form of crystals. The Natl crystal was obtained by mixing equal volume of protein solution (7- 
to 7.5 mg/ml protein) and reservoir solution of 13% PEG 8000 (w/v), 0.1 15 M (NH 4 ) 2 S0 4 0.1 M NaCacodylate (pH 6.8), 

45 2% glycerol. The Nat2 crystal was crystallized using reservoir solution of 12% PEG8000 (w/v), 15% isopropanol, 0.1 M 
Hepes (pH 7.5). The crystals belong to the space group P2 1 and have unit cell dimensions a = 45.2A, b = 65.7A, c = 
58.1 A, d = 93.9°. The crystals contained one molecule per asymmetric unit and are 53% solvent by volume. The crys- 
tals of binary complex with AMP-PNP were obtained by co-crystallization first under the same crystallization condition 
as Natl crystal in the presence of 1 .25 mM AMP-PNP and 2.5 mM MgCI 2 , then the resulting crystals were soaked in 

50 mother liquor containing 5 mM MgCI 2 and 20 mM AMP-PNP fortwo days. The co-crystals had the identical space group 
(P2 1 ) and cell dimensions as the native crystals. All diffraction data were collected at -170°C. Crystals were introduced 
into cryoprotectant solution containing its reservoir solution and 20% glycerol. For AMP-PNP co-crystal, additional 10 
mM MgCI 2 and AMP-PNP were included in cryoprotectant solution. Crystals were then flash frozen in a stream of nitro- 
gen gas -170°C. All data collection was carried out with home source using CuK y-radiation produced by a Rigalu rota- 

55 tion anode FR5 X-ray generator equipped with focusing mirrors and measured with a Mar 345 image-plate detector. All 
data were processed with the Denzo/HKL package (Otwinowski, Z., "Oscillation Data Reduction Program", Proceed- 
ings of the CCP4 Study Weekend: Data Collection and Processing, pp. 56—62, compiled by: L. Sawyer, et al., SERC 
Daresbury Laboratory, England (January 29-30, 1993)). 
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[0140] Initial apoenzyme structure determination using Natl crystal data was carried out by molecular replacement 
(MR) using modified Cdk2 structure (omitted loop regions) (Russo, AA et al„ Nature 382(6589) :325-31 (Jul 25, 1996)) 
as a search model. Rotation and translation functions using the AMoRe software (Navaza J, Acta Crystallographic, 
50(2): Section A (March, 1 994)) revealed a solution using Natl data from 1 0 to 4 A. The MR model was refined by sim- 

5 ulated annealing (X-plor). However, after successive rounds of rebuilding and refinement, 2Fo-Fc and Fo-Fc electron 
density maps were poorly defined at the loop regions which were omitted from the initial model. To obtain additional 
phase information, multiple isomorphous replacement was carried out with two heavy metal derivatives: 0.5 mM HgCI 2 
(soaked for 1 5 hrs) and 5 mM Kau (CN) 2 (soaked for 1 7 hrs). Five Hg sites and five Au sites were identified by difference 
Fourier synthesis using phases generated from the MR partial model and were consistent with both isomorphous and 

10 anomalous difference Patterson maps. The positional and thermal parameters and relative occupancies for the heavy 
atom sites were refined using SIR data at 3 A and anomalous data at 3.5 A by program PHASES (Furey, W et al. 
"Phases: a Package of Computer Programs Designed to Compute Phase Angles for Diffraction Data from Macromo- 
lecular Crystals", American Crystallographic Association, Series 2, 18:73 (1990)). Sixteen cycles of solvent flattening 
were then carried out using phases calculated from refined Hg and Au positions. The resultant electron density maps 

75 showed a good backbone density and well-defined side chains for most part of the protein. Model building utilized the 
program FRODO (Jones, T.A., J Appl Cryst, 11: 268-272 (1978)). The missing loop regions were incorporated into the 
model using both MIR maps and model phased 2Fo-Fc maps. Further refinement in XPLOR (Brunger, AT. et al., X- 
PLOR Version 3.1: A System for X-ray Crystallography and NMR D , Yale University Press, (1992)) and then CNS 
(Brunger, AT. et al, Crystallography & NMR System, Acta Cryst., D54: 905-921 (1998)) were continued with both con- 

20 jugate gradient minimization and simulated annealing, then followed by manually rebuilding. 

[0141] Refinement of Nat2 structure was carried out by using refined Natl model but omitting residues 1 53-1 70 as 
well as S0 4 . Fo-Fc maps showed well defined densities for the omitting region and its conformation is exactly same as 
that in Natl. 

[0142] Refinement of the binary complex with AMP-PNP was proceeded with refining the position of the refined 
25 apo-enzyme model (Natl) as rigid body against the complex data using CNS program. Fo-Fc maps with __A (Read, 
R.J., Acta Cryst., A42: 140-149 (1986)) weighting showed clear density for the adenine and ribose components of 
AMP-PNP. The conformation of residues forming the binding pocket was checked in simulated annealing omit maps 
before including the adenine and ribose components of AMP-PNP. 

[0143] The apo-enzyme model (Natl) included all atoms for residues 2 to 44 and 48 to 276, 183 ordered solvent 
30 molecules and one S0 4 molecule The refined Nat2 structure contained the same number of residues and solvent mol- 
ecules but the S0 4 molecule was not present. The refined AMP-PNP complex contained the same number of residues 
as apo-structures, with 150 ordered solvent molecules and one S0 4 molecule. The triphosphate moiety of AMP-PNP 
was disordered and no Mg 2+ ions were visible. The final model had all residues in "most favored" or "additional allowed" 
regions of the Ramachandran plot according to PROCHECK (Laskowski RA et al., J. Appl. Cryst, 26: 283-291 (1 993)), 
35 with no residues in "generously allowed" or "disallowed" REGIONS, indicating the well refined nature of the identified 
crystal structure. The terms "generously allowed" and "disallowed" are descriptions of the configuration of Phi and Psi 
angles of the protein structure. A well refined protein structure should not place these angles in the unpreferred or non- 
naturally occurring configurations. 

40 7. The Overall Kinase Structure 

[0144] The crystal structures of the kinase domain of human Chk1 and its binary complex with an ATP analog, 
AMP-PNP, have been determined to 1 .7 A resolution. Both structures contain the kinase core domain (residues 2-267) 
and residues in the linker region that connects the N-terminal kinase domain with the C-terminal region of Chk1. The 
45 crystallographic analysis is summarized in Table 2. The Chk1 crystal coordinates for the apoenzyme (isolated active 
Chk1) and the binary complex (Chk1 complexed with AMP-PNP, an ATP analog) are shown in Figures 1 1 A and 1 1 B, 
respectively. The coordinates of the fixed water molecules are also included therein. 

[0145] The kinase domain of human Chk1 has a canonical kinase two-lobe fold, with the ATP binding cleft between 
the two lobes (Figure 5, structure model). The smaller N-terminal lobe contains one helix (aC) and 5 p-strands (P1 to 

so p5) that form a curved anti-parallel p-sheet. The larger C-terminal lobe contains a cluster of 7 helices (<xD to al), packed 
against 6 P strands (p6 to pi 1 ) which border the cleft. One p strand (P6') comprises the hinge region connecting the two 
lobes. In both apo-enzyme and binary structures, the ATP binding site, catalytic residues, and the activation loop are 
well ordered. Comparison with crystal structures of other kinases indicates that the Chk1 kinase domain is closely 
related to PhK (Lowe, ED et al., EMBOJ, 16(22):6646-58 (Nov 17, 1997)) (See Figure 1A, 1B). The N-terminal lobe 

55 (Residues 2-90) superimposes with an r.m.s. derivation for Ca atoms of 1 .1 A, while the C-terminal lobe (Residues 91 - 
276) superimposes with an r.m.s. derivation for Ca atoms of 0.9 A. In the C-terminal lobe, major differences are found 
in helix aG, and the connecting loop between aG and aH. These are not included in the superposition. The Chk1 
apoenzyme adopts a more open conformation compared to PhK. The N-terminal lobe of Chk1 is rotated -15° relative 
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to the ternary complex of PhK with its substrates. Comparison of the AMP-PNP bound Chk1 binary complex with the 
apoenzyme structure shows no conformational change. A high degree of sequence homology for Chk1 kinase domains 
of different species (Figure 2) suggests that there is an overall structural conservation of the kinase domain. Residues 
that are not modeled in the current structures are not conserved in Chk1. For example, there is a six-residue insertion 

5 in the loop connecting 03 and aC in S. pombe Chk1 . 

[0146] The two lobes are held together by an extensive hydrogen-bond network at the lobe interface which involves 
the loop linking aC and 34 of the N-terminal lobe, 06' of the hinge region, and 37 and 38 of the C-terminal lobe. This 
network extends from the back of the protein to the front opening of the ATP binding cleft. Residues involved in this net- 
work also form part of the pocket that interacts with the adenine moiety of AMP-PNP. Strand 38 immediately precedes 

w the kinase conserved DFG motif, in which Asp148 is important for the alignment of the phosphate groups of ATP. The 
only reported mutation in the Chk1 kinase domain is at the lobe interface. Replacement of the conserved Glu85 by Asp 
leads to a temperature-sensitive phenotype in fission yeast in which the mutant maintains cell cycle arrest after UV irra- 
diation but impairs the DNA replication checkpoint at nonpermissive temperature (Francesconi, S et al., EMBO J, 
16(6): 1332-41 (Mar 17, 1997)). The side chain of Glu85 at the end of strand 35 forms hydrogen bonds with the side 

15 chain of conserved Lys145 from strand 38 as well as with the main chain amide of conserved Lys69 that precedes 
strand 34. These interactions, together with the extensive hydrogen-bond network at the lobe interface, appear to play 
an important role in maintaining the correct disposition of the N-terminal lobe and the DFG loop during lobe movement. 
The Glu to Asp mutation, while maintaining similar charge, would not be long enough to form those hydrogen bonds 
provided by Glu85, thereby weakening lobe interactions and rendering the mutant protein less stable at higher temper- 

20 ature. 

[0147] Most of the invariant residues of Chk1 proteins are located in the C-terminal lobe. Many of them are also 
conserved among Ser/Thr kinases and are involved in stabilizing the catalytically active kinase conformation and in 
binding ATP. The positions of several invariant motifs of Chk1 proteins are noteworthy. Compared with other Ser/Thr 
kinases, the IEPDIG motif (residues 96-101) shortens aD to a one-turn helix, since Pro98 initiates a tight turn between 

25 aD and ccE. This turn interacts with the C-terminus of helix ccF through a backbone hydrogen bond between Asp99 and 
the invariant Giy204. In this turn, Glu97 forms backbone hydrogen bonds with Ile100 and Gly101 . The unique confor- 
mation of this motif appears to be important for peptide substrate interaction, since the side chains of Ile96 and Pro98 
form part of a hydrophobic pocket that interact with the peptide substrate as discussed below. Helix ccE contains a con- 
served motif of AQXFFXQL (residues 107-1 14; SEQ ID NO: 24), with the hydrophobic residues buried inside the C-ter- 

30 minal lobe. The side chain of Gin 1 08 projects towards the linker region that follows the kinases core domain and forms 
hydrogen bonds directly or through a water molecule to backbone atoms of Lys267, Leu269 and Lys270. Although 
Chk1 sequences diverge in this linker region, these backbone interactions with Gin 1 08 could still be conserved, holding 
the linker against the N-terminus of ccE. Helix aG is positioned differently compared with aG of PhK. Two sets of invar- 
iant PW residues (207 and 208, 230 and 231) flanking aG, although separated by 21 residues, are in van der Waals 

35 contact and connected to the hydrophobic core of the C-terminal lobe. This stabilizes the surface for peptide substrate 
interaction. 

Activation and Catalytic Loops 

40 [0148] Interesting features of the Chk1 kinase domain include interactions that stabilize the activation loop. The 
structure of the activation loop determines the alignment of residues contacting ATP and performing catalysis in protein 
kinases. Interacting with the catalytic loop, the activation loop orients the catalytic Asp; interacting with the N-terminal 
lobe, the activation loop closes the N and C terminal lobes and aligns residues that interact with the phosphates of ATP. 
The activation loop is defined as the region between the conserved motifs of DFG and APE corresponding to residues 

45 148 to 177 of Chk1 . Conformational changes in the activation loop serve as a major regulatory mechanism for kinase 
activity. In the human Chk1 structures, the activation loop is folded in a conformation similarto those found in structures 
of active kinases, consistent with the observation that the Chk1 kinase domain is constitutively active. This active con- 
formation is stabilized by special features of Chk1 secondary structures and their side chain interactions (Figures 3 
and 5, homology model and crystal structure). 

so [0149] The N-terminus of the activation loop interacts with the catalytic loop through the interaction of 36 and 39. 
Immediately following 39, 310 interacts with pi 1 to form a two-stranded 3-ioop with a turn at Asn159. This 3-loop is 
packed against the N-terminus of the catalytic loop and positions the highly conserved Arg156 and Glu161. The side 
chain of Arg156 interacts with the carbonyl of the invariant His122 at the end of aE. Through the invariant Asp190, the 
side chain of His122 is connected to the amide of Arg129, adjacent to the catalytic residue Asp130. The carboxyl of 

55 Glu1 61 forms a hydrogen bond with the imidazole of His185 that precedes ccF. These interactions anchor this end of the 
activation loop to the core of the C-terminal lobe. The center of the activation loop interacts with the rest of C-terminal 
lobe through two backbone hydrogen bonds between Leu164 and Phe184. The activation loop ends at its C-terminus 
with a turn which is supported by aEF In human Chk1, aEF is anchored at two positions to the core of the C-terminal 
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lobe through two ion-pairs, one is the invariant kinase ion-pair between Glu1 77 and Arg253, another is between Lys1 80 
and Glu248 which is unique to Chk1 . This extra ion-pair constrains the movement of aEF, and in turn the movement of 
the C-terminal end of the activation loop. The pair of Lys180 and Glu248 is only conserved in vertebrate Chk1 , suggest- 
ing potential flexibility of aEF and the activation loop of Chk1 in lower organisms such as S. pombe. 

5 [0150] Crystal structures of kinases indicate that the conformation of the activation loop is influenced by its negative 
charge which neutralizes a cluster of positively charged residues, although the ionic interaction may not be absolutely 
required as in the case of mammalian casein kinase I. The negative charge is provided by phosphate through phospho- 
rylation, carboxyl group of Glu, or solvent ions. In Chk1, the positively charged cluster of Arg129, Arg162, Lys166, and 
Lys54 is present, but no phosphorylation is observed. In both the apoenzyme and binary complex structures deter- 
to mined to 1 .7 A, a sulfate ion was close to the phosphate position of the phosphothreonine (Thr1 97) in PKA. This sulfate 
ion interacts with Arg129, Arg162, and Thr153. Sulfate is present in the crystallization solution and could contribute to 
the stability of the positively charged cluster and the activation loop. To clarify the role of this sulfate ion and to better 
understand the interactions that stabilize the activation loop, crystals were produced under sulfate-free condition and 
determined the structure to 2.1 A (Table 2). This 2.1 A structure is referred as Nat2 structure, whereas the 1 .7 A apoen- 

15 zyme structure is referred as Natl structure. In Nat2 structure, no sulfate ion is present. 

[0151] Superimposition of Natl and Nat2 structures revealed similar conformations for the corresponding activation 
loops except for the side chain of Arg1 62 which turns toward the solvent in Nat2 structure. The side chain of Arg1 62 is 
flexible in both structures as indicated by its high temperature factors. Arg162 is an invariant residue of Chk1 and its 
function is not readily apparent from the structure. In both the Natl and Nat2 structures, the side chain of Arg129 forms 

20 hydrogen bonds to three main chain carbonyl oxygens (Leu1 51 , Ala152, and Lys166) directly or via water molecules. 
The positive charge of Arg129 could be neutralized by the thiol group of Cys168 which is in the vicinity of side chains 
of Lys1 66 and Arg1 29. In this basic environment, this thiol could become a thiolate ion, Cys1 68 is invariant in Chk1 and 
is conserved in many kinases such as PKA and PhK. Our results rule out the role of sulfate ion in stabilization of the 
activation loop of Chk1 . Instead, the activation loop and the catalytic loop are stabilized by its unique secondary struc- 

25 tures and their extensive side chain interactions. 

[0152] A difference between Chk1 and other kinases is the permuted positions of Lys166 and Thr153 (Figure 2). 
Lys166 occupies the equivalent position as Glu1 82 of PhK and the phosphorylated Thr1 97 of PKA, whereas Thr153 is 
equivalent to Lys189 of PKA. The side chain of Thr153 forms a hydrogen bond with the side chain of Lys54 located in 
helix aC. Thr153 is conserved in Chk1 (Thr or Ser) and is a candidate for phosphorylation in the activation loop. The 

30 permuted position, however, makes phosphorylation of Thr153 unlikely. The activation loop is already in an active con- 
formation in Chk1 and phosphorylation would be unnecessary. Lys54 is conserved in all but S. pombe Chk1 and adja- 
cent to Glu55 which forms the invariant ion-pair with Lys38 in active kinases. The interaction between Thr153 and 
Lys54, therefore, appears to play a similar role to the interaction between His87 and the phosphate of Thr197 of PKA. 
The side chain of Lys1 66 points to Cys1 68 and its position appears to play a role in determining the substrate specificity 

35 as discussed below. In S. pombe Chk1 , the residue that corresponds with Lys1 66 is Ser, suggesting potential regulation 
of the activity of S. pombe Chk1 through phosphorylation. Concomitantly, the activation loop of S. pombe Chk1 appears 
to be more flexible since its substitutions would disrupt some of the interactions that stabilize the activation loop. 

Catalytic Residues and AMP-PNP Binding 

40 

[0153] The glycine-rich loop that anchors the phosphate groups of ATP in kinases is poorly ordered in Chk1 , as evi- 
denced by the high B factors in this region for both apoenzyme structures and AMP-PNP bound binary complex struc- 
ture. Residues 1 8-21 at the apex of the loop between (31 and (32 are flexible with poor electron density. These residues 
are highly conserved in kinases and anchor the (3-phosphate of ATP in ATP-bound kinase structures. The flexibility of 
45 this loop could play a role in regulating Chk1 kinase activity, indeed, Tyr20 present in higher organisms corresponds 
structurally to Tyr15 of Cdc2 which following phosphorylation inhibits Cdc2 activity (Coleman TR, et al., CurrOpin Cell 
Bio, 6(6):877-82 (Dec, 1994); Russo, AA et al., Nature, (1996), supra ). 

[0154] One striking feature among the active ternary complexes such as PKA and PhK is the close similarity of the 
active site residue conformation, their interacti ons with the ATP and coordination of the metal ions. The binary com- 

50 plexes that have been solved show no such conservation (Knighton DR, et al., J Mol Biol, 220(2):21 7-20 (Jul 20, 1 991 ); 
Bossemeyer, Detal., EMBOJ, 12(3): 849-59 (Mar 1993); Zheng J, etal., Protein Sci, 2(1 0):1 559-73 (Oct 1993); Owen 
DJ, etal., Structure, 3(5):467-82 (May 15, 1995); Lowe, et al., EMBOJ, (Nov 17, 1997), supra .V Many of the active site 
residues in the Chk1 structure have interactions quite similar to those in ternary complexes of Phk and PKA (Figure 4A, 
4B). In the N-terminal lobe, the invariant ion pair of active kinases is present between Lys38 and Glu55; the correspond- 

55 ing Lys in PhK and PKA interacts with a and (3 phosphates of ATP. Helix aC is firmly attached to the rest of N-terminal 
lobe through hydrophobic interactions and is in an active position relative to the rest of the N-terminal lobe. It also inter- 
acts with the DFG loop in the C-terminal lobe, the side chain of Glu55 from aC rests above Gly150. The relative side 
chain positions of Lys38, Glu55, and Asp148 are similar to those for the corresponding residues in the ternary com- 
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plexes of PKA and PhK. These residues in PKA and PhK, together with the glycine-rich loop, coordinate a Mg 2+ and 
anchor the a and (3 phosphates of ATP. In the C-terminal lobe, the conformation of the catalytic loop (residues 130-135) 
of Chk1 is nearly identical to that in PhK with the side chains of Asp130, Lys132, and Asn135 in Chk1 nearly superim- 
posable to the corresponding residues Asp149, Lys151 , and Asn 154 in PhK in which Lys151 binds to the y-phosphate 

5 of AMP-PNP and Asn154 chelates another Mg 2+ that binds to the P and y phosphates of AMP-PNR Thr1 70 is con- 
served in all serine/threonine protein kinases and appears to determine the specificity of Ser/Thr verses Tyr as phos- 
pho-acceptor. Thr170 forms hydrogen bonds with Asp130 and Lys132 analogous to Thr186 in PhK and these 
interactions are needed for the positioning the carbonyl of the catalytic residue Asp1 30. The residues of Chk1 , however, 
are far apart from those in the N-terminal lobe and the DFG loop due to the somewhat open lobe conformation (Figure 

w 6). The DFG loop is positioned higher than its counterparts in PKA and PhK. Lys38Neis 10 A away from Asp130O52, 
compared with 8.2 A in Phk and 7.8 A in PKA. Asp1 48081 is 6 A away from Asp130O52, compared with 3.8 A in PhK 
and 4.8 A in PKA. In Chk1, one water molecule is located between Asp148 and Asp130 and is hydrogen bonded to 
Asp1 30O52 as well as Asn1 35051 . The side chain of Asn1 35 is over 1 A farther away from Asp1 48 relative to the active 
conformation in PhK. The residues that are necessary for ATP phosphate binding and catalysis are clustered in two 

75 separate parts, although they maintain their local interactions. The lack of electron density of the triphosphate moiety 
of AMP-PNP in the binary complex of Chk1 probably results from misalignment of these residues as well as flexibility 
in the glycine-rich loop. 

[0155] The adenine and ribose moieties are clearly defined in our current model. As in all the structures of kinases 
with ATP, the adenine base is almost completely buried in a hydrophobic pocket between the two lobes, and hydrogen 

20 bonds are formed between N6 of adenine and the main chain carbonyl of Glu85, and between N1 and amide of Cys87. 
As in PhK, Chk1 N7 interacts with the side chain of Ser147 via a water molecule in Chk1. However, the ribose ring 
adopts a C2'-encfo conformation similar to that in the inactive form of Cdk2 (PDB ID code 1HCK, (De Bondt HL, et al., 
Nature, 363(6430) :595-602 (Jun 1 7 1 993); Schulze-Gahmen U et al., J Med Chem, 39(23):4540-6 (Nov 8, 1 996)), with 
the 02' hydrogen-bonding to Glu91, and 03' hydrogen bonding to the carbonyl of Leu 15 in the glycine-rich loop. Incom- 

25 parison, the ribose rings have C3'-endo puckering in the active ternary complexes of PKA and PhK. 

Substrate Specificity and Interactions That Stabilize the Closed Conformation 

[0156] The structured activation loop of Chk1 provided an opportunity to explore the basis of peptide substrate spe- 
30 cificity. The close resemblance of Chk1 with PhK and the available structures of PhK with and without peptide substrate 
enable us to model the interactions of peptide substrate with Chk1 . The interaction of kinases with their peptide sub- 
strates has been analyzed for three kinases, PKA with an inhibitor peptide of PKI (PDB code 1ATR (Knighton DR. J Mot 
Biol, (Jul 20, 1991), supra .). PhK with MC-peptide (PDB code 2PHK, (Lowe, et al., EMBO J, (Nov 17, 1997), supra. ), 
and insulin receptor tyrosine kinase with a peptide substrate (PDB code 1IR3, (Hubbard SR, EMBO J, 16(18):5572- 
35 81 (Sep 15, 1997)). In all three tertiary complex structures, the backbones of peptide substrates around the phosphate 
acceptor residues adopt extended conformation and interact mainly with the C-terminal lobes. 
[0157] The known Chk1 kinase substrate is the Cdc25C protein phosphatase. Several phosphate acceptor Ser res- 
idues have been identified in the Cdc25C protein sequence. Consensus features can be derived from sequences sur- 
rounding the phosphate acceptor Ser (position P): The N-terminal P-3 position is a conserved Arg, P-5 positions prefers 
40 bulky hydrophobic residues, and P-2 is Ser or Thr. Phosphorylation of Ser216 of human Cdc25C is required for DNA 
damage induced G2 arrest and Ser216 is phosphorylated by Chk1 in vitro (Peng et al., Science (1997), supra .: 
Sanchez et al., Science (1997), supra .). Therefore, the peptide LYRSPSMPE spanning residues 211-219 of human 
Cdc25C was used to model the interaction of peptide substrate with Chk1 , based on the ternary complex of PhK with 
MC-peptide. 

45 [0158] The modeled Cdc25C peptide easily fits into a groove on the C-terminal lobe of Chk1 , following a path very 
similar to that of the MC-peptide bound to PhK (Figure 7). The Oy atom of Ser(P), the presumed nucleophiie in the 
phosphate transfer reaction, is very close to an ordered water molecule in Chk1 structures. This water molecule hydro- 
gen bonds to both the Asp130O82 and Lys132Ne. Superposition of Chk1 and PhK shows that this water molecule 
would be 3.4 A from the y-phosphorus atom of the AMP-PNP in PhK. The position of this water molecule probably indi- 

so cates the approximate location of the seryl hydroxyl during catalysis. 

[0159] The hydrophobic side chain of Leu(P-5) fits into the hydrophobic pocket formed by Phe93, Ile96, Pro98, and 
Leu206. All of these residues except Leu206 are invariant in Chk1 proteins. The side chain of Arg(P-3) points towards 
Glu91 of Chk1. However, in its extended conformation, the guanidinium group of this Arg can only make a hydrogen 
bond (3 A ) with the carboxyl of Glu91 . In both PKA and PhK, the guanidinium of Arg(P-3) forms a salt bridge (2.5 A ) 

55 with the carboxyl of the corresponding Glu residues. As discussed below, ionic interaction of Arg and Glu91 could be 
established after lobe closure. 

[0160] The side chain of Ser(P-2) could make a hydrogen bond to the backbone carbonyl oxygen of Pro(P-1). In 
PhK, Gln(P-2) of the MC-peptide interacts with Ser188. This interaction is not available to Chk1 since it has an invariant 
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Prol 72 in the corresponding position of Ser1 88 in PhK. Pro1 72, then, may contribute to the specificity of Chk1 for Ser 
or Thr at P-2 position and the internal hydrogen bond provided by Ser or Thr at P-2 position may play a role in maintain- 
ing the conformation of the substrate backbone at its N-terminus. 

[0161] The hydrophobic side chain of Met(P+1) projects into a hydrophobic pocket formed by residues of Leu171, 

5 Val174, Leu178, Leu179, and Met167. The P+2 position can only accommodate a small side chain or a turn due to the 
unique position of Lys1 66. Lys1 66 is conserved among vertebrate Chk1 proteins. Correspondingly, Pro is found at the 
P+2 position of the Cdc25 substrates. Pro(P+2) creates a consensus 14-3-3 binding site once the Ser(P) is phosphor- 
ylated. The Lys166 of human Chk1 is a Ser residue in S. pombe Chk1 . The side chain of S. pombe Chk1 could be phos- 
phorylated and point to the position corresponding to the sulfate ion in human Chk1 structure. Correspondingly, bulky 

w side chains are present at the P+2 position of the substrates of S. pombe Chk1 . 

[0162] Phosphorylation of Cdc25C by Chk1 is very specific such that the Ser(P-2) is not phosphorylated. This is 
important for Cdc25C regulation since phosphorylation at the P-2 position would destroy the 14-3-3 binding site. Our 
model clearly indicates determinants for Chk1 substrate specificity: hydrophobic interaction through the P-5 and P+1, 
ionic interaction through P-3, Ser/Thr at P-2, and small amino acid side chains at the P+2 position. 

15 [0163] Although the recombinant Chk1 kinase domain is active when assayed in solution, the structure reveals that 
it is not in a closed catalytically active conformation in either the apoenzyme or the binary crystal structure. This result 
suggests that the apoenzyme and the ATP bound binary complex favor the open conformation. Lobe movement is com- 
mon in kinase domains and catalysis requires a closed conformation (Cox S, et ak, Curr Opin Struct Biol, 4(6):893- 
901 (Dec, 1994); Gangal M, etal., Biochemistry, 37(39): 13728-35 (Sep 29, 1998)). Interactions that stabilize the closed 

20 active conformation have not been addressed in detail in previous reports. Our model suggests that a key interaction in 
Chk1 is the ion-pair between Glu91 with Arg(P-3) of peptide substrate. 

[0164] Superposition of Chk1 and PhK structures indicates that lobe closure of Chk1 can be achieved by a simple 
rotation of the N-terminal lobe by -15 degree around residue Glu91. This rotation would place Glu91 closer to Arg(P- 
3) and establish an ion-pair between the carboxylate group of Glu91 and the guanidinium group of the Arg(P-3). Lobe 

25 closure could also change the ribose conformation of AMP-PNP to a C3'-endo conformation from the C2'-endo confor- 
mation in the binary complex. The catalytically active kinase ternary complex structures reported to date have their 
respective ribose rings puckered in a C3'-endo conformation. For Chk1 , when the ribose is modeled in a C3'-encfo con- 
formation, two hydrogen bonds can form between the carboxyl group of Glu91 and the 02' and 03' of the ribose. In 
comparison, the binary complex of Chk1 with AMP-PNP has only one hydrogen bond between Glu91 and the ribose. 

30 The Chk1 kinase domain in solution likely shifts dynamically ("breathes") between the open and closed conformation. 
The current Chk1 structures have open conformations and have revealed that the ATP binding cleft is accessible to 
solution. In the closed conformation, residues for phosphate binding and catalysis come together and align the phos- 
phate for transfer. The additional interaction of Glu91 with Arg(P-3) of peptide substrate and with the ribose of ATP 
would shift the equilibrium to the closed active conformation. Therefore, peptide substrates gain specificity partly 

35 through their ability to stabilize the closed catalytically active conformation of Chk1 . 

8. Regulation of Chk1 Kinase Activity 

[0165] Phosphorylation of the Chk1 substrate, Cdc25, and the resulting cell cycle arrest has been correlated with 

40 the activation of Chk1 after DNA damage. Whether phosphorylation of Chk1 regulates its kinase activity is unclear. The 
structure of human Chk1 suggests that its activity is not regulated through phosphorylation of the activation loop. 
Instead, the activation loop of Chk1 appears to be anchored by extensive interactions through rigid secondary struc- 
tures and their side chains. Interestingly, phosphorylation of the activation loop could occur in S. pombe Chk1 which has 
a Ser substitution at the position of Lys166. Whether Chk1 is regulated differently in S. pombe and mammals requires 

45 the identification of residues that are phosphorylated after DNA damage. 

[0166] The structure of the Chk1 kinase domain and its binary complex with AMP-PNP provide insight into its acti- 
vation mechanism. First, the structures reveal an unique arrangement of the residues for phosphate binding and catal- 
ysis. Specifically, the residues for a and (3 phosphate binding are separated from those for y phosphate binding and 
catalysis. Alignment of these residues is achieved in a closed conformation which is stabilized by peptide substrate. Our 

so model predicts low ATPase activity of Chk1 and favors an ordered kinetic mechanism in which ATP binding precedes 
the peptide substrate binding. Secondly, the structures exclude a role for the activation loop of human Chk1 in regulat- 
ing the kinase domain conformation. The activation loop is most likely maintained by rigid secondary structures and the 
extensive interactions of their side chains. However, a possibility of different regulatory mechanism exists for S. pombe 
Chk1 , which may reflect their different cell cycle processes and different DNA damage repair mechanisms. In addition, 

55 the interactions that stabilize the active kinase conformation have been identified. The presence of Glu in many kinase 
hinge regions and Arg at P-3 position of their substrates suggests a general role for this interaction in maintaining the 
closed conformation for Ser/Thr kinases. Interactions that determine the peptide substrate specificity suggest a consen- 
sus sequence that is useful to identify potential Chk1 substrate. Finally, Chk1 kinase domain structure provides a guide 
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for its future characterization as well as design of specific inhibitors that could abrogate checkpoint control for cancer 
therapy. 

9. Enzymatic Activity of Chk1 

5 

[0167] The enzymatic activity of a kinase is measured by its ability to catalyze the transfer of a phosphate residue 
from a nucleoside triphosphate to an amino acid side chain in a selected protein target. The conversion of ATP to ADP 
generally accompanies the catalytic reaction. Herein, a synthetic substrate peptide, Syntide-2, having amino acid 
sequence PLARTLSVAGLPGKK (SEQ ID NO. 11) was utilized. The production of ADP from ATP that accompanies 

w phosphoryl transfer to the substrate was coupled to oxidation of NADH using phosphoenolpyruvate (PEP) through the 
actions of pyruvate kinase (PK) and lactic dehydrogenase (LDH). The oxidation of NADH was monitored by following 
the decrease of absorbance at 340 nm (e340=6.22 cm-1 mM-1) using a HP8452 spectrophotometer. Typical reaction 
solutions contained: 4 mM PEP, 0.15 mM NADH, 28 units of LDH/mL, 16 units of PK/mL, 3 mM DTT, 0. 125 mM Syn- 
tide-2, 0.15 mM ATP and 25 mM MgCI 2 in 50 mM TRIS pH 7.5; 400 mM NaCI. Assays were initiated with 10 nM of 

75 kinase domain of Chk1 , KH289. Kj values were determined by measuring initial enzyme activity in the presence of var- 
ying concentrations of inhibitors. The data were analyzed using Enzyme Kinetic and Kaleidagraph software. 
[0168] The table below (Table 3) compares three different preparations of Chk1. The first preparation is the full 
length form, which comprises amino acids 1 -476 of SEQ ID NO. 2. The next preparation contains proteolytically cleaved 
fragments, a mixture of Chk1 protein fragments obtained from the full-length protein during fermentation. The exact 

20 enzymes involved and cleavage site generated for these fragments is unknown. However, analysis of the fragments 
indicated that one of them is similar in size to the 1-289. The third preparation is the kinase domain of amino acids 1- 
289 of SEQ ID NO. 2 (KH289) As mentioned above, the assay used detects the ADP product by coupling through the 
enzymatic actions of pyruvate kinase and lactate dehydrogenase. 

25 

Table 3 



Prep No. 


Prep 


Concentration 


Rate/mi n 


Activity 


Ki 


HA2-013 


Full Length Chk1 


75nM 


0.0190 


1 (control) 


48±1nM 


HA2-022 


Proteolytically cleaved Chk1 


2nM 


0.0208 


+38 fold 


37±5nM 


HB2-061 


Kinase Domain Chk1 (1-289) 


7.3nM 


0.0200 


+10 fold 


68 ±12 nM 



[0169] Additional activity comparison experiments were performed using new preparations of full length Chk1 , pro- 
35 teolitically cleaved Chk1 , and kinase domain Chk1 . The preparation conditions were as described above. Once again, 
the cleaved preparation was 38 fold more active than the non-cleaved preparation. 

10. High Throughput Screens 

40 [0170] The following substrates were tested for peptide content and activity :• 



Table 4 



Peptide Substrates 






Activity 


Peptide 


Syntide 2 


PLARTLSVAGLPGKK (SEQ ID NO. 11) 


100% 


75% 


Syntide 3 


KAGAG-PLARTLSVAGLPG-Biotin-K (SEQ ID NO. 12) 


67% 


50% 


Syntide 4 


Ac - PLARTLSVAGLPG-AGAGAGAK (SEQ ID NO. 13) 


72% 


45% 


Syntide 5 


PLARTLS (P0 3 ) VAGLPGKK (SEQ ID NO. 15) 


NT 


42% 


Syntide 6 


PLARTLS (P0 3 ) VGALPGK (Biotin) (SEQ ID NO. 16) 


NT 


77% 



55 

[0171 J As described in detail below, an aspect of the invention involves a nonradioactive ELISA based assay suita- 
ble for high throughput screening (HTS). The development of the ELISA based CHK1 kinase HTS assay was first initi- 
ated with a monoclonal anti-phosphoserine antibody called Clone PSR-45, supplied by Sigma. New Chk peptide 
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substrates, analogues of Syntide2, were synthesized to validate this assay. These peptides are listed in Table 4. Biotin- 
Syntide-2 (SEQ ID NO. 12), and N-terminus acetylated Syntide-2 (SEQ ID NO. 13) and the expected peptide products 
after CHK phosphorylation, serine phosphorylated Syntide 2 (SEQ ID NO. 15), and serine phospholylated biotin-Syn- 
tide 2 (SEQ ID NO. 16) were synthesized for assay development. Although the assay worked well in solution with these 

5 peptides, it did not work when the peptide (serine phosphorylated Syntide 2 — SEQ ID NO. 15) was immobilized on 
DNA BIND (Costar) 96 well plates. This antibody also did not work well when the biotin-labeled peptide was immobilized 
using Neutravidin coated 96 well plates (Pierce). To circumvent these issues, a polyclonal antibody specifically directed 
against phosphorylated Syntide-2 (SEQ ID NO. 15) was raised in rabbits. The rabbit polyclonal antiphosphosyntide 
antibody was found to quantitatively and specifically recognize phosphoserine on both Syntide 2-Ser-P0 3 (assay on 

10 DNA BIND plates) or on biotin-Syntide 2-Ser-P0 3 (assayed on Neutravidin coated 96 well plates) when compared with 
the unphosphorylated peptide counterparts. A modified Chk1 HTS assay ELISA was developed using His-tagged 
KH289 Chk1 kinase, biotinsyntide substrate assayed on Neutravidin coated 96 well plates, and the rabbit anti-phospho- 
syntide antibody to detect the phosphorylated product. 

[0172] This Chk1 kinase ELISA HTS allowed for the robotic screening of compound libraries. Herein, the Beckman 

15 robotics station was used. First; the Chk1 kinase was assayed in Neutravidin coated 96-welI plates in 100 uL/well of 
reaction mixture. The reaction mixture comprised 50 mM Tris-HCI (pH 7.5), 10 mM MgCI 2 , 3 mM DTT, 400 mM NaCI, 
50 uM ATP, 10 u.M biotin-Syntide 2 peptide substrate and 10 nM Chk1 kinase (KH289). The assay was performed both 
with and without 20 uM test compound. Herein, the biotin Syntide 2 substrate had the following sequence: 
PLARTLSVAGLPGK-biotin-K (SEQ ID NO. 12). 

20 [0173] The assay is depicted in Figure 10. In step A, 93 u.L of reaction mixture (less both the Chk1 kinase and the 
biotin-syntide) is added, followed by the addition of 2 uL of test compound (20 \iN\ final). The kinase reaction is initiated 
by the addition of 5 uL of enzyme -substrate stock (200 nM Chk1 kinase and 200 jliM biotin-syntide). The kinase reaction 
is allowed to proceed for 10 min at room temperature (s 22 °C) as shown in Step B. Following 10 minutes of kinase 
reaction, both phosphorylated and unphosphorylated biotin-Syntide 2 are bound to the Neutravidin coated plate. In step 

25 C, the plates are washed with PBS/Tween-20 to terminate the kinase reaction and to remove the unbound phosphor- 
ylated or non-phosphorylated biotin- Syntide 2. In step D, the plates are incubated at room temperature for 60 minutes 
with rabbit anti-phosphosyntide antibody (1 : 40,000 dilution; 1 00 uL/well). The anti-phosphosyntide antibody binds spe- 
cifically to the serine-phosphorylated biotin-Syntide 2. The unbound antibody is removed with washes of PBS/Tween- 
20. The plates are then incubated at room temperature for 60 minutes with goat-anti-rabbit- 1 gG(Fc)-HRP (horseradish 

30 peroxidase) antibody. In step E, the plates are washed with PBS/Tween to remove the unbound secondary antibody. 
Then, 100 jxUwell chromogenic dye ABTS (HRP substrate) is added. The color development, resulting from the HRP 
reaction, is allowed to take place for 18 minutes. This is followed by absorbance measurement at 405 nm in a 96-well 
plate reader. The Chk1 kinase activity is directly proportional to the optical density of the color formed. 
[0174] All references cited herein are incorporated by reference in their entirety. 

35 [0175] While the invention has been described in conjunction with examples thereof it is to be understood that the 
foregoing description is exemplary and explanatory in nature, and is intended to illustrate the invention and its preferred 
embodiments. Through routine experimentation, the artisan will recognize apparent modifications and variations that 
may be made without departing from the spirit of the invention. Thus, the invention is intended to be defined not by the 
above description, but by the following claims and their equivalents. 
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SEQUENCE LISTINGS 



[0176] 
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SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 
SEQ ID NO. 



1 — full length human Chk1 (nucleotide sequence — 1933 base pairs) 

2 — full length human Chk1 (peptide sequence - 476 AA) 



50 



3 — PCR primer (chk6w) 

4 — PCR primer (KH289) 

5 — PCR primer (K289) 

6 — PCR primer (Chk11) 

7 — PCR primer (K210) 

8 — PCR primer (KH210) 

9 _ PCR primer (K248) 

10 — PCR primer (KH248) 
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1 1 — synthetic substrate peptide, Syntide-2 

1 2 — synthetic substrate peptide, Syntide-3 

13 — synthetic substrate peptide, Syntide-4 

14 — oligonucleotide primer 
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SEQ ID NO. 15 — serine phosphorylated Syntide-2 

SEQ ID NO. 16 — serine phosphoxylated biotin Syntide-2 

SEQ ID NO. 17 — peptide sequence for Cdc25 protein phosphatase 

SEQ ID NO. 18 —peptide sequence for mouse (mm) Chk1 kinase domain 

5 SEQ ID NO. 19 — peptide sequence for Xenopus (x1) Chk1 kinase domain 

SEQ ID NO. 20 — peptide sequence for fruit fly (dm) Chk1 kinase domain 
SEQ ID NO. 21 —peptide sequence for C. elegans (ce) Chk1 kinase domain 
SEQ ID NO. 22 —peptide sequence for S. cerevisiae (sc) Chk1 kinase domain 
SEQ ID NO. 23 — peptide sequence for S. pombe (sp) Chk1 kinase domain 

w SEQ ID NO. 24 —conserved motif AQXFFXQL for Chk1 kinase domain, helix _E (residues 1 07-1 14) 



75 



20 



25 



30 



35 



40 



45 



50 
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SEQUENCE LISTING 
<110> Agouron Pharmaceuticals, Inc. 

<120> Catalytic Domain of the Human Effector Cell cycle 
Checkpoint Protein Kinase, Chkl, Materials and 
Methods for Identification of Inhibitors Thereof 

<130> 30189 

<150> 60/162,887 
<151> 1999-11-01 

<150> 09/460,421 
<151> 1999-12-14 

<160> 24 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 1821 

<212> DNA 

<213> Homo sapiens 



<400> 1 

ggccggacag 

ggacttggtg 

agtaactgaa 

agaaaatatt 

attctatggt 

aggagagctt 

attcttccat 

ggatattaaa 

tggcttggca 

tactttacca 

tgatgtttgg 

ccaacccagt 

cccttggaaa 

tccatcagca 

caagaaaggg 

attttctaag 

agaaaatgtg 

taccagcccc 

tcctgatcat 

ctggcagcgg 

ttatcaatgc 

gaatcaggtt 

tttgttagaa 

ggagttcaag 

gaaggtttgg 

agtgctgcta 

atagtagttc 

ttgttcggca 

gaatagaatt 

tggtggaaac 

ttttatcaaa 



tccgccgagg 
caaaccctgg 
gaagcagtcg 
aagaaagaga 
cacaggagag 
tttgacagaa 
caactcatgg 
ccagaaaatc 
acagtatttc 
tatgttgctc 
tcctgtqgaa 
gacagctgtc 
aaaatcgatt 
agaattacca 
gcaaaaaggc 
cacattcaat 
aagtactcca 
tcatacattg 
atgcttttga 
ttggtcaaaa 
ctgaaagaga 
actatatcaa 
atggatgata 
agacacttcc 
cttcctgcca 
tgttgacatt 
ctgaagtgtt 
tacaaataat 
catttga tta 
caagtt tcag 
acattttgtt 



tgctcggtgg 
gagaaggtgc 
cagtgaagat 
tctgtatcaa 
aaggcaatat 
tagagccaga 
caggggtggt 
ttctgttgga 
ggtataataa 
cagaacttct 
tagtacttac 
aggagtattc 
ctgctcctct 
ttccagacat 
cccgagtcac 
ccaatttgga 
gttctcagcc 
ataaattggt 
atagtcagtt 
gaatgacacg 
cttgtgagaa 
caactgatag 
aaatattggt 
tgaagattaa 
catgatcgga 
attcttccta 
cacttccctg 
acctatatct 
tttcttcatg 
gggacat gag 
t 



agtcatggca 
ctatggagaa 
tgtagatatg 
taaaatgcta 
ccaatattta 
cataggcatg 
ttatctgcat 
tgaaagggat 
tcgtgagcgt 
gaagagaaga 
tgcaatgctc 
tgactggaaa 
agctctgctg 
caaaaaagat 
ttcaggtggt 
cttctctcca 
agaaccccgc 
acaagggatc 
acttggcacc 
attctttacc 
gttgggctat 
gagaaacaat 
tgacttccgg 
agggaagctg 
ccatcggctc 
gagaagatta 
tttatccaaa 
taattgtaag 
tgtgtttagt 
ttttccagct 



gtgccctttg 
gttcaacttg 
aagcgtgccg 
aatcatgaaa 
tttctggagt 
cctgaaccag 
ggtattggaa 
aacctcaaaa 
ttgttgaaca 
gaatttcatg 
gctggagaat 
gaaaaaaaaa 
cataaaatct 
agatggtaca 
gtgtcagagt 
gtaaacagtg 
acaggtcttt 
agcttttccc 
ccaggatcct 
aaattggatg 
caatggaaga 
aaactcattt 
ctttctaagg 
attgatattg 
tggggaatcc 
tcctgtcctg 
catcttccaa 
caaaactttg 
atctgaattt 
tttatacaca 



tggaagactg 
ctgtgaatag 
tagactgtcc 
atgtagtaaa 
actgtagtgg 
atgctcagag 
taactcacag 
tctcagactt 
agatgtgtgg 
cagaaccagt 
tgccatggga 
catacctcaa 
tagttgagaa 
acaaacccct 
ctcccagtgg 
cttctagtga 
ccttatggga 
agcccacatg 
cacagaaccc 
cagacaaatc 
aaagttgtat 
tcaaagtgaa 
gtgatggatt 
tgagcagcca 
tggtgaatat 
caaactgcaa 
tttattttgt 
gggaaaggat 
gaaactcatc 
cgtatctcat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

7B0 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1821 



<210> 2 

<211> 476 

<212> PRT 

<213> Homo sapiens 
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<400> 2 

Met Ala Val Pro Phe Val Glu Asp Trp Asp Leu Val Gly Thr Leu Gly 
15 10 15 

Glu Gly Ala Tyr Gly Glu Val Gin Leu Ala Val Asn Arg Val Thr Glu 
20 25 30 

Glu Ala Val Ala Val Lys lie Val Asp Met Lys Arg Ala Val Asp Cys 
35 40 45 

Pro Glu Asn lie Lys Lys Glu He Cys He Asn Lys Met Leu Asn His 
50 55 60 

Glu Asn Val Val Lys Phe Tyr Gly His Arg Arg Glu Gly Asn lie Gin 
65 70 75 80 

Tyr Leu Phe Leu Glu Tyr Cys Ser Gly Gly Glu Leu Phe Asp Arg He 
85 90 95 

Glu Pro Asp He Gly Met Pro Glu Pro Asp Ala Gin Arg Phe Phe His 
100 105 110 

Gin Leu Met Ala Gly Val Val Tyr Leu His Gly He Gly He Thr His 
115 120 125 

Arg Asp He Lys Pro Glu Asn Leu Leu Leu Asp Glu Arg Asp Asn Leu 
130 135 140 

Lys He Ser Asp Phe Gly Leu Ala Thr Val Phe Arg Tyr Asn Asn Arg 
145 150 155 160 

Glu Arg Leu Leu Asn Lys Met Cys Gly Thr Leu Pro Tyr Val Ala Pro 
165 170 175 

Glu Leu Leu Lys Arg Arg Glu Phe His Ala Glu Pro Val Asp Val Trp 
180 185 190 

Ser Cys Gly He Val Leu Thr Ala Met Leu Aia Gly Glu Leu Pro Trp 
195 200 205 

Asp Gin Pro Ser Asp Ser Cys Gin Glu Tyr Ser Asp Trp Lys Glu Lys 
210 215 220 

Lys Thr Tyr Leu Asn Pro Trp Lys Lys He Asp Ser Ala Pro Leu Ala 
225 230 235 240 

Leu Leu His Lys He Leu Val Glu Asn Pro Ser Ala Arg He Thr He 
245 250 255 

Pro Asp He Lys Lys Asp Arg Trp Tyr Asn Lys Pro Leu Lys Lys Gly 
260 265 270 

Ala Lys Arg Pro Arg Val Thr Ser Gly Gly Val Ser Glu Ser Pro Ser 
275 280 285 

Gly Phe Ser Lys His He Gin Ser Asn Leu Asp Phe Ser Pro Val Asn 
290 295 300 

Ser Ala Ser Ser Glu Glu Asn Val Lys Tyr Ser Ser Ser Gin Pro Glu 
305 310 315 320 



Pro Arg Thr Gly Leu Ser Leu Trp Asp Thr Ser Pro Ser Tyr He Asp 
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325 



330 



335 



Lys Leu Val Gin Gly He Ser Phe Ser Gin Pro Thr Cys Pro Asp His 
340 345 350 

Met Leu Leu Asn Ser Gly Leu Leu Gly Thr Pro Gly Ser Ser Gin Asn 
355 360 365 

Pro Trp Gin Arg Leu Val Lys Arg Met Thr Arg Phe Phe Thr Lys Leu 
370 375 380 

Asp Ala Asp Lys Ser Tyr Gin Cys Leu Lys Glu Thr Cys Glu Lys Leu 
385 390 395 400 

Gin Tyr Gin Trp Lys Lys Ser Cys Met Asn Gin Val Thr He Ser Thr 
405 410 415 

Thr Asp Arg Arg Asn Asn Lys Leu lie Phe Lys Val Asn Leu Leu Glu 
420 425 430 

Met Asp Asp Lys He Leu Val Asp Phe Arg Leu Ser Lys Gly Asp Gly 
435 440 445 

Leu Glu Phe Lys Arg His Phe Leu Lys He Lys Gly Lys Leu He Asp 
450 455 460 



He Val Ser Ser Gin Lys Val Trp Leu Pro Ala Thr 
465 470 475 



<210> 3 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 3 

gagctcagta ccatctatct tttttgatgt ctgg 



<210> A 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 4 

gagctcagtt ggtggtggtg gtggtgtcca ctgggagact ctgacac 



<210> 5 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 5 
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gagctcatcc actgggagac tctgacac 



28 



<210> 6 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 6 

ccatggagct caagaaaggg gcaaaaagg 29 

<210> 7 
<2ll> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 



<210> 8 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 8 

gagctcagtg gtggtggtgg tggtggtggt cccatggcaa ttctcc 46 

<210> 9 
<2U> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 



<210> 10 
<211> 49 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 10 

gagctcagtg gtggtggtgg tggtgctcaa ctaagatttt atgcagcag 49 



<400> 7 

gagctcattg gtcccatggc aattctcc 



28 



<400> 9 

gagctcactc aactaagatt ttatgcagca g 
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<210> 11 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 11 

Pro Leu Ala Arg Thr Leu Ser Val Ala Gly Leu Pro Gly Lys Lys 
1 5 10 15 



<210> 12 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<220> 

<223> Biotinylated 
<400> 12 

Lys Ala Gly Ala Gly Pro Leu Ala Arg Thr Leu Ser Val Ala Gly Leu 
15 10 15 

Pro Gly Lys 



<210> 13 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<400> 13 

Pro Leu Ala Arg Thr Leu Ser Val Ala Gly Leu Pro Gly Ala Gly Ala 
1 5 10 15 

Gly Ala Gly Ala Lys 
20 



<210> 14 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide primer 

<400> 14 

tgaataatcc ggcatatgta taggtttttt 
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<210> 15 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<220> 

<221> M0D_RES 
<222> (7) 

<223> phosphorylated serine 
<400> 15 

Pro Leu Ala Arg Thr Leu Ser Val Ala Gly Leu Pro Gly Lys Lys 
1 5 10 15 



<210> 16 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
peptide 

<220> 

<221> MOD_RES 
<222> (7) 

<223> phosphorylated serine 
<220> 

<223> Biotinylated 
<400> 16 

Pro Leu Ala Arg Thr Leu Ser Val Gly Ala Leu Pro Gly Lys 
15 10 



<210> 17 

<211> 473 

<212> PRT 

<213> Homo sapiens 

<400> 17 

Met Ser Thr Glu Leu Phe Ser Ser Thr Arg Glu Glu Gly Ser Ser Gly 
1 5 10 15 

Ser Gly Pro Ser Phe Arg Ser Asn Gin Arg Lys Met Leu Asn Leu Leu 
20 25 30 

Leu Glu Arg Asp Thr Ser Phe Thr Val Cys Pro Asp Val Pro Arg Thr 
35 40 45 

Pro Val Gly Lys Phe Leu Gly Asp Ser Ala Asn Leu Ser lie Leu Ser 
50 55 60 

Gly Gly Thr Pro Lys Cys Cys Leu Asp Leu Ser Asn Leu Ser Ser Gly 
65 70 75 80 
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Glu lie Thr Ala Thr Gin Leu Thr Thr Ser Ala Asp Leu Asp Glu Thr 
85 90 95 

Gly His Leu Asp Ser Ser Gly Leu Gin Glu Val His Leu Ala Gly Met 
100 105 110 

Asn His Asp Gin His Leu Met Lys Cys Ser Pro Ala Gin Leu Leu Cys 
115 120 125 

Ser Thr Pro Asn Gly Leu Asp Arg Gly His Arg Lys Arg Asp Ala Met 
130 135 140 

Cys Ser Ser Ser Ala Asn Lys Glu Asn Asp Asn Gly Asn Leu Val Asp 
145 150 155 160 

Ser Glu Met Lys Tyr Leu Gly Ser Pro He Thr Thr Val Pro Lys Leu 
165 170 175 

Asp Lys Asn Pro Asn Leu Gly Glu Asp Gin Ala Glu Glu He Ser Asp 
180 185 190 

Glu Leu Met Glu Phe Ser Leu Lys Asp Gin Glu Ala Lys Val Ser Arg 
195 200 205 

Ser Gly Leu Tyr Arg Ser Pro Ser Met Pro Glu Asn Leu Asn Arg Pro 
210 215 220 

Arg Leu Lys Gin Val Glu Lys Phe Lys Asp Asn Thr He Pro Asp Lys 
225 230 235 240 

Val Lys Lys Lys Tyr Phe Ser Gly Gin Gly Lys Leu Arg Lys Gly Leu 
245 250 255 

Cys Leu Lys Lys Thr Val Ser Leu Cys Asp He Thr He Thr Gin Met 
260 265 270 

Leu Glu Glu Asp Ser Asn Gin Gly His Leu He Gly Asp Phe Ser Lys 
275 280 285 

Val Cys Ala Leu Pro Thr Val Ser Gly Lys His Gin Asp Leu Lys Tyr 
290 295 300 

Val Asn Pro Glu Thr Val Ala Ala Leu Leu Ser Gly Lys Phe Gin Gly 
305 310 315 320 

Leu He Glu Lys Phe Tyr Val He Asp Cys Arg Tyr Pro Tyr Glu Tyr 
325 330 " 335 

Leu Gly Gly His He Gin Gly Ala Leu Asn Leu Tyr Ser Gin Glu Glu 
340 345 350 

Leu Phe Asn Phe Phe Leu Lys Lys Pro He Val Pro Leu Asp Thr Gin 
355 360 365 

Lys Arg He He He Val Phe His Cys Glu Phe Ser Ser Glu Arg Gly 
370 375 380 

Pro Arg Met Cys Arg Cys Leu Arg Glu Glu Asp Arg Ser Leu Asn Gin 
385 390 395 " 400 



Tyr Pro Ala Leu Tyr Tyr Pro Glu Leu Tyr He Leu Lys Gly Gly Tyr 
405 410 415 
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Arg Asp Phe Phe Pro Glu Tyr Met Glu Leu Cys Glu Pro Gin Ser Tyr 
420 425 430 

Cys Pro Met His His Gin Asp His Lys Thr Glu Leu Leu Arg Cys Arg 
435 440 445 

Ser Gin Ser Lys Val Gin Glu Gly Glu Arg Gin Leu Arg Glu Gin He 
450 455 460 

Ala Leu Leu Val Lys Asp Met Ser Pro 
465 470 



<210> 18 
<211> 289 
<212> PRT 
<213> Murine sp. 

<400> 18 

Met Ala Val Pro Phe Val Glu Asp Trp Asp Leu Val Gin Thr Leu Gly 
15 10 15 

Glu Gly Ala Tyr Gly Glu Val Gin Leu Ala Val Asn Arg lie Thr Glu 
20 25 30 

Gin Ala Val Ala Val Lys He Val Asp Met Lys Arg Ala He Asp Cys 
35 40 45 

Pro Gin Asn He Lys Lys Glu He Cys He Asn Lys Met Leu Ser His 
50 55 60 

Glu Asn Val Val Lys Phe Tyr Gly His Arg Arg Glu Gly His He Gin 
65 70 75 80 

Tyr Leu Phe Leu Glu Tyr Cys Ser Gly Gly Glu Leu Phe Asp Arg He 
85 90 95 

Glu Pro Asp He Gly Met Pro Glu Gin Asp Ala Gin Arg Phe Phe His 
100 105 110 

Gin Leu Met Ala Gly Val Val Tyr Leu His Gly He Gly He Thr His 
115 120 125 

Arq Asp He Lys Pro Glu Asn Leu Leu Leu Asp Glu Arg Asp Asn Leu 
130 135 140 

Lys He Ser Asp Phe Gly Leu Ala Thr Val Phe Arg His Asn Asn Arg 
145 150 155 160 

Glu Arg Leu Leu Asn Lys Met Cys Gly Thr Leu Pro Tyr Val Ala Pro 
165 170 175 

Glu Leu Leu Lys Arg Lys Glu Phe His Ala Glu Pro Val Asp Val Trp 
180 185 190 

Ser Cys Gly He Val Leu Thr Ala Met Leu Ala Gly Glu Leu Pro Trp 
195 200 205 

Asp Gin Pro Ser Asp Ser Cys Gin Glu Tyr Ser Asp Trp Lys Glu Lys 
210 215 220 



Lys Thr Tyr Leu Asn Pro Trp Lys Lys lie Asp Ser Ala Pro Leu Ala 
225 230 235 240 
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Leu Leu His Lys lie Leu Val Glu Thr Pro Ser Ala Arg He Thr lie 
245 250 255 

Pro Asp lie Lys Lys Asp Arg Trp Tyr Asn Lys Pro Leu Asn Arq Gly 
260 265 270 

Ala Lys Arq Pro Arg Ala Thr Ser Gly Gly Met Ser Glu Ser Ser Ser 
275 280 285 

Gly 



<210> 19 
<211> 288 
<212> PRT 
<213> Xenopus sp. 

<400> 19 

Met Ala Val Pro Phe Val Glu Asp Trp Asp Leu Val Gin Thr Leu Gly 
IS 10 15 

Glu Gly Ala Tyr Gly Glu Val Gin Leu Ala Val Asn Arg Lys Thr Glu 
20 25 " 30 

Glu Ala Val Ala Val Lys lie Val Asp Met Thr Arg Ala Ala Asp Cys 
35 40 45 

Pro Glu Asn lie Lys Lys Glu He Cys He Asn Arg Met Leu Ser His 
50 55 60 

Thr Asn He Val Arg Phe Tyr Gly His Arg Arg Glu Gly Asn He Gin 
65 70 75 80 

Tyr Leu Phe Leu Glu Tyr Cys Arg Gly Gly Glu Leu Phe Asp Arg He 
85 90 95 

Glu Pro Asp Val Gly Met Pro Glu Gin Asp Ala Gin Lys Phe Phe Gin 
100 105 110 

Gin Leu He Ala Gly Val Glu Tyr Leu His Ser He Gly He Thr His 
115 120 125 

Arg Asp He Lys Pro Glu Asn Leu Leu Leu Asp Glu Arg Asp Gin Leu 
130 135 140 

Lys He Ser Asp Phe Gly Leu Ala Thr Val Phe Arg His Asn Gly Lys 
145 150 155 160 

Glu Arg Leu Leu Ser Lys Met Cys Gly Thr Leu Pro Tyr Val Ala Pro 
165 170 * 175 

Glu Leu He Lys Ser Arg Ala Phe His Ala Asp Pro Val Asp Val Trp 
180 185 190 

Ser Cys Gly He Val Leu Thr Ala Met Leu Ala Gly Glu Leu Pro Trp 
195 200 205 

Asp Gin Pro Asn Glu Val Cys Gin Glu Tyr Cys Asp Trp Lys Glu Lys 
210 215 220 

Asn His Tyr Leu Thr Pro Trp Lys Lys He Ser Ala Thr Pro Leu Ala 
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225 230 235 240 

Leu Leu Gly Lys Met Leu Thr Glu Asn Pro Gin Ser Arg lie Thr Xle 
245 250 255 

Pro Asp He Lys Lys Asp Arg Trp Phe Thr Glu He He Lys Lys Gly 
260 265 270 

Leu Lys Arg Ser Arg Val He Ser Gly Gly Ser Ser Asp Ser Ser Val 
27b 280 285 



<210> 20 
<211> 305 
<212> PRT 

<213> Drosophila sp. 
<400> 20 

Met Ala Ala Thr Leu Thr Glu Ala Gly Thr Gly Pro Ala Ala Thr Arg 
15 10 15 

Glu Phe Val Glu Gly Trp Thr Leu Ala Gin Thr Leu Gly Glu Gly Ala 
20 25 30 

Tyr Gly Glu Val Lys Leu Leu He Asn Arg Gin Thr Gly Gly Gly Cys 
35 40 45 

Gly Met Lys Met Val Asp Leu Lys Lys His Pro Asp Ala Ala Asn Ser 
50 55 60 

Val Arg Lys Glu Val Cys He Gin Lys Met Leu Gin Asp Lys His He 
65 70 75 80 

Leu Arg Phe Phe Gly Lys Arg Ser Gin Gly Ser Val Glu Tyr He Phe 
85 90 * 95 

Leu Glu Tyr Ala Ala Gly Gly Glu Leu Phe Asp Arg He Glu Pro Asp 
100 105 110 

Val Gly Met Pro Gin His Glu Ala Gin Arg Tyr Phe Thr Gin Leu Leu 
115 120 125 

Ser Gly Leu Asn Tyr Leu His Gin Arg Gly He Ala His Arg Asp Leu 
130 135 140 

Lys Pro Glu Asn Leu Leu Leu Asp Glu His Asp Asn Val Lys He Ser 
145 150 155 " 160 

Asp Phe Gly Met Ala Thr Met Phe Arg Cys Lys Gly Lys Glu Arg Leu 
165 170 * 175 

Leu Asp Lys Arg Cys Gly Thr Leu Pro Tyr Val Ala Pro Glu Val Leu 
180 185 ' 190 

Gin Lys Ala Tyr Gin Pro Gin Pro Ala Asp Leu Trp Ser Cys Gly Val 
195 200 205 

He Leu Val Thr Met Leu Ala Gly Glu Leu Pro Trp Asp Gin Pro Ser 
210 215 220 

Thr Asn Cys Thr Glu Phe Thr Asn Trp Arg Asp Asn Asp His Trp Gin 
225 230 235 " 240 
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Leu Gin Thr Pro Trp Ser Lys Leu Asp Thr Leu Aia He Ser Leu Leu 
245 250 255 

Arg Lys Leu Leu Leu Ala Thr Ser Pro Gly Thr Arg Leu Thr Leu Glu 
260 265 270 

Lys Thr Leu Asp His Lys Trp Cys Asn Met Gin Phe Ala Asp Asn Glu 
275 280 285 

Arg Ser Tyr Asp Leu Val Asp Ser Ala Ala Ala Leu Glu He Cys Ser 
290 295 300 



Pro 
305 



<210> 21 
<211> 299 
<212> PRT 
<213> C. elegans 

<400> 21 

Met Ser Ala Ala Ser Thr Thr Ser Thr Pro Ala Ala Ala Ala Val Ala 
15 10 15 

Pro Gin Gin Pro Glu Ser Leu Tyr Arg Val Val Gin Thr Leu Gly Glu 
20 25 30 

Gly Ala Phe Gly Glu Val Leu Leu He Val Asn Thr Lys Asn Pro Glu 
35 40 45 

Val Ala Ala Ala Met Lys Lys He Asn He Ala Asn Lys Ser Lys Asp 
50 55 60 

Phe He Asp Asn lie Arg Lys Glu Tyr Leu Leu Gin Lys Arg Val Ser 
65 70 75 80 

Ala Val Gly His Asp Asn Val He Arg Met He Gly Met Arg Asn Asp 
85 90 95 

Pro Gin Phe Tyr Tyr Leu Phe Leu Glu Tyr Ala Asp Gly Gly Glu Leu 
100 105 110 

Phe Asp Lys He Glu Pro Asp Cys Gly Met Ser Pro Val Phe Ala Gin 
115 120 125 

Phe Tyr Phe Lys Gin Leu He Cys Gly Leu Lys Phe He His Asp Asn 
130 135 140 

Asp Val Val His Arg Asp lie Lys Pro Glu Asn Leu Leu Leu Thr Gly 
145 150 ~ 155 160 

Thr His Val Leu Lys He Ser Asp Phe Gly Met Ala Thr Leu Tyr Arg 
165 170 175 

Asn Lys Gly Glu Glu Arg Leu Leu Asp Leu Ser Cys Gly Thr He Pro 
180 185 190 

Tyr Ala Ala Pro Glu Leu Cys Ala Gly Lys Lys Tyr Arg Gly Pro Pro 
195 " 200 205 



Val Asp Val Trp Ser Ser Gly He Val Leu He Ala Met Leu Thr Gly 
210 215 220 
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Glu Leu Pro Trp Asp Arg Ala Ser Asp Ala Ser Gin Ser Tyr Met Gly 
225 230 235 240 

Trp He Ser Asa Thr Ser Leu Asp Glu Arg Pro Trp Lys Lys He Asp 
245 250 255 

Val Arg Ala Leu Cys Met Leu Arg Lys He Val Thr Asp Lys Thr Asp 
260 265 270 

Lys Arg Ala Thr He Glu Gin He Gin Ala Asp Pro Trp Tyr Gin His 
275 280 285 

Asn Phe Gly Gin Val Glu Thr Pro Asn Gly Arg 
290 295 



<210> 22 
<211> 306 
<212> PRT 

<213> S. cerevisiae 
<400> 22 

Met Ser Leu Ser Gin Val Ser Pro Leu Pro His He Lys Asp Val Val 
1 5 10 ^ * 15 

Leu Gly Asp Thr Val Gly Gin Gly Ala Phe Ala Cys Val Lys Asn Ala 
20 25 30 

His Leu Gin Met Asp Pro Ser He He Leu Ala Val Lys Phe He His 
35 40 45 

Val Pro Thr Cys Lys Lys Met Gly Leu Ser Asp Lys Asp He Thr Lys 
50 55 60 

Glu Val Val Leu Gin Ser Lys Cys Ser Lys His Pro Asn Val Leu Arg 
65 70 " 75 80 

Leu He Asp Cys Asn Val Ser Lys Glu Tyr Met Trp He He Leu Glu 
85 90 95 

Met Ala Asp Gly Gly Asp Leu Phe Asp Lys He Glu Pro Asp Val Gly 
100 105 110 

Val Asp Ser Asp Val Ala Gin Phe Tyr Phe Gin Gin Leu Val Ser Ala 
115 120 125 

He Asn Tyr Leu His Val Glu Cys Gly Val Ala His Arg Asp He Lys 
130 135 140 

Pro Glu Asn He Leu Leu Asp Lys Asn Gly Asn Leu Lys Leu Ala Asp 
145 150 155 160 

Phe Gly Leu Ala Ser Gin Phe Arg Arg Lys Asp Gly Thr Leu Arg Val 
165 170 175 

Ser Met Asp Gin Arg Gly Ser Pro Pro Tyr Met Ala Pro Glu Val Leu 
180 185 190 

Tyr Ser Glu Glu Gly Tyr Tyr Ala Asp Arg Thr Asp He Trp Ser He 
195 200 * 205 

Gly He Leu Leu Phe Val Leu Leu Thr Gly Gin Thr Pro Trp Glu Leu 
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210 



215 



220 



Pro Ser Leu Glu Asn Glu Asp Phe Val Phe Phe Tie Glu Asn Asp Gly 
225 230 235 240 

Asn Leu Asn Trp Gly Pro Trp Ser Lys lie Glu Phe Thr His Leu Asn 
245 250 255 

Leu Leu Arg Lys lie Leu Gin Pro Asp Pro Asn Lys Arg Val Thr Leu 
260 265 ~ 270 

Lys Ala Leu Lys Leu His Pro Trp Val Leu Arg Arg Ala Ser Phe Ser 
275 280 285 

Gly Asp Asp Gly Leu Cys Asn Asp Pro Glu Leu Leu Ala Lys Lys Leu 
290 295 300 



Phe Ser 
305 



<210> 23 
<211> 295 
<212> PRT 
<213> S. pombe 

<400> 23 

Met Ala Gin Lys Leu Asp Asn Phe Pro Tyr His lie Gly Arg Glu lie 
15 10 15 

Gly Thr Gly Ala Phe Ala Ser Val Arg Leu Cys Tyr Asp Asp Asn Ala 
20 25 30 

Lys lie Tyr Ala Val Lys Phe Val Asn Lys Lys His Ala Thr Ser Cys 
35 40 45 

Met Asn Ala Gly Val Trp Ala Arg Arg Met Ala Ser Glu He Gin Leu 
50 55 60 

His Lys Leu Cys Asn Gly His Lys Asn He lie His Phe Tyr Asn Thr 
65 70 75 80 

Ala Glu Asn Pro Gin Trp Arg Trp Val Val Leu Glu Phe Ala Gin Gly 
85 90 95 

Gly Asp Leu Phe Asp Lys He Glu Pro Asp Val Gly He Asp Glu Asp 
100 105 110 

Val Ala Gin Phe Tyr Phe Ala Gin Leu Met Glu Gly He Ser Phe Met 
115 120 125 

His Ser Lys Gly Val Ala His Arg Asp Leu Lys Pro Glu Asn He Leu 
130 135 140 

Leu Asp Tyr Asn Gly Asn Leu Lys He Ser Asp Phe Gly Phe Ala Ser 
145 150 " 155 160 

Leu Phe Ser Tyr Lys Gly Lys Ser Arg Leu Leu Asn Ser Pro Val Gly 
165 170 375 

Ser Pro Pro Tyr Ala Ala Pro Glu He Thr Gin Gin Tyr Asp Gly Ser 
180 185 = 190 
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Lys Val Asp Val Trp Ser Cys Gly lie He Leu Phe Ala Leu Leu Leu 
195 200 205 

5 

Gly Asn Thr Pro Trp Asp Glu Ala He Ser Asn Thr Gly Asp Tyr Leu 
210 215 220 

Leu Tyr Lys Lys Gin Cys Glu Arg Pro Ser Tyr His Pro Trp Asn Leu 
225 230 235 240 

10 

Leu Ser Pro Gly Ala Tyr Ser He He Thr Gly Met Leu Arg Ser Asp 
245 250 255 

Pro Phe Lys Arg Tyr Ser Val Lys His Val Val Gin His Pro Trp Leu 
15 260 265 270 

Thr Ser Ser Thr Pro Phe Arg Thr Lys Asn Gly Asn Cys Ala Asp Pro 
275 280 285 

20 Val Ala Leu Ala Ser Arg Leu 

290 295 



<210> 24 

25 <211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: conserved 
30 motif 

<220> 

<221> M0D_RES 
<222> (3) 
35 <223> variable residue 

<220> 

<221> MOD_RES 
<222> (6) 
40 <223> variable residue 

<400> 24 

Ala Gin Xaa Phe Phe Xaa Gin Leu 
1 5 



Claims 

50 

1. A composition comprising an isolated, purified polynucleotide which encodes the active form of the human Chk1 
kinase or a functional, active human Chk1 kinase analog thereof. 

2. The composition according to claim 1 , wherein the nucleotide sequence of said polynucleotide comprises bases 35 
55 to 830 of SEQ ID NO. 1 or a functional, active mutant or variant thereof. 

3. A polypeptide in a crystallized form comprising the catalytically active form of the human Chk1 kinase and the inhib- 
itor binding site thereof. 
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4. The polypeptide according to claim 3 wherein the crystal is solved to a resolution of at least 2.5 O- 

5. The polypeptide according to claim 3 wherein the crystal is solved to a resolution of at least 2.0 O- 

6. The polypeptide according to claim 3 wherein the crystal is solved to a resolution of about 1 .7 O- 

7. The polypeptide according to claim 3 wherein the amino acid sequence of said polypeptide comprises amino acids 
1 6 to 265 of SEQ ID NO. 2 or an active mutant or variant thereof. 

8. The polypeptide according to claim 3 wherein the amino acid sequence of said polypeptide comprises amino acids 
1 6 to 289 of SEQ ID NO. 2 or an active mutant or variant thereof. 

9. The polypeptide according to claim 3 wherein the amino acid sequence of said polypeptide comprises amino acids 
16 to 291 of SEQ ID NO. 2 or an active mutant or variant thereof. 

10. The polypeptide according to claim 3 wherein the amino acid sequence of said polypeptide comprises amino acids 
1 to 265 of SEQ ID NO. 2 or an active mutant or variant thereof. 

11. The polypeptide according to claim 3 wherein the amino acid sequence of said polypeptide comprises amino acids 
1 to 289 of SEQ ID NO. 2 or an active mutant or variant thereof. 

12. The polypeptide according to claim 3 wherein the amino acid sequence of said polypeptide comprises amino acids 
1 to 291 of SEQ ID NO. 2 or an active mutant or variant thereof. 

13. The polypeptide according to claim 3 wherein said polypeptide further comprises a six histidine tag on the C-termi- 
nal thereof. 

1 4. An isolated, soluble, catalytically active polypeptide comprising the active form of the human Chk1 kinase or a func- 
tional, active human Chk1 kinase analog thereof. 

15. The polypeptide according to claim 14 comprising the full length human Chk1 protein having the C-terminal portion 
thereof deleted so as yield the human Chk1 kinase domain in its active configuration. 

16. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 1 6 to 265 of the sequence 
as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

17. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 1 6 to 289 of the sequence 
as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

18. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 1 6 to 291 of the sequence 
as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

19. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 1 to 265 of the sequence 
as set forth in SEQ ID NO.2 or a conservatively substituted variant thereof. 

20. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 1 to 289 of the sequence 
as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

21. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 1 to 291 of the sequence 
as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

22. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 5 to 265 of the sequence 
as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

23. The polypeptide according to claim 1 4 wherein said polypeptide comprises amino acids 5 to 289 of the sequence 
as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

24. The polypeptide according to claim 14 wherein said polypeptide comprises amino acids 5 to 291 of the sequence 
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as set forth in SEQ ID NO. 2 or a conservatively substituted variant thereof. 

25. An expression vector for producing active human Chk1 kinase in a host cell, which vector comprises: a polynucle- 
otide encoding active form of the human Chk1 kinase or an active human Chk1 kinase analog thereof; transcrip- 

5 tional and translational regulatory sequences functional in said host cell operably linked to said human Chk1 

kinase-encoding polynucleotide; and a selectable marker. 

26. The vector according to claim 25 wherein said polynucleotide encodes the active human Chk1 kinase, said active 
kinase comprising bases 35 to 830 of SEQ ID NO. 1 . 

10 

27. The vector according to claim 25 wherein said vector is selected from the group consisting of pET28a, pAcSG2, 
and pFastBac. 

28. The vector according to claim 25 wherein said vector is pFastBac-Nde. 

15 

29. The vector according to claim 25 wherein said selectable marker is selected from the group consisting of beta 
galactosidase, green fluorescent protein, and luciferase. 

30. A host cell stably transformed and transfected with a polynucleotide encoding active form of the human Chk1 
20 kinase or an active human Chk1 kinase analog thereof in a manner allowing the expression in said host cell of the 

human Chk1 kinase. 

31. The host cell according to claim 30, wherein said polynucleotide encodes the active hChkl kinase, said active 
kinase comprising bases 35 to 830 of SEQ ID NO. 1 . 

25 

32. The host cell according to claim 30 wherein said host is E. colL 

33. The host cell according to claim 30 wherein said host is a recombinant baculovirus. 
30 34. The host cell according to claim 30 wherein said host is an insect cell. 

35. The host cell according to claim 34 wherein said insect cell is Sf9. 

36. The host cell according to claim 30 wherein said host cell is transformed and transfected with said polynucleotide 
35 via an expression vector comprising said polynucleotide; a transcriptional and translational regulatory sequences 

functional in said host cell operably linked to said hChkl kinase-encoding polynucleotide; and a selectable marker. 

37. The host cell according to claim 36 wherein said expression vector is selected from the group consisting of pET28a, 
pAcSG2, and pFastBac. 

40 

38. The host cell according to claim 36 wherein said expression vector is pFastBac-Nde. 

39. The host cell according to claim 36 wherein said selectable marker is selected from the group consisting of beta 
galactosidase, green fluorescent protein, and luciferase. 

45 

40. A method for assaying a candidate compound for its ability to interact with the human Chk1 comprising: 

(a) expressing an isolated DNA sequence or variants thereof encoding the kinase domain of said human Chk1 
in a host capable of producing said kinase in the catalytically active configuration, said kinase in a form which 

50 may be assayed for interaction of said kinase with said candidate compound; 

(b) exposing said kinase to said candidate compound; and 

(c) evaluating the interaction of said kinase with said candidate compound. 

41. A method of identifying a Chk1 kinase inhibitor by determining the binding interactions between an organic com- 
55 pound and the binding site of the Chk1 kinase in the active conformation, said binding sites being defined by the 

crystal coordinates of provided in Figure 1 1 , said method comprising: 

(a) generating the binding cavity defined by the binding site on a computer screen; 
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(b) generating compounds with their spatial structure; and 

(c) testing to see whether the compounds bind to at the Chk1 binding site; wherein those compounds that do 
bind to the Chk1 binding site can be identified as Chk1 inhibitors. 
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His-togged CHK1 Kinase domain 1-289 Purification 




a 



Q-Sepharose (150 ml) 
Flow through 



Ni-NTA Agarose (30 ml) 
Elute with 20-300 mM 
imidazole gradient 



Dialysis in 
25 mM Tris, pH7.5 
500 mM NaCI 
5 mM On 

Dilute to 
200 mM NaCI 
10 mM MaCl2 
5% glycerol 

ATP-Sepharose (40 ml) 

Elute with 

25 mM Tris, pH7.5 

500 mM NaCI 

5% glycerol 

5 mM DTT 



Cleared Cell Lysate 
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20 mM imidazole 
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EP 1 096 014 A2 



ATOM 2185 0CT1 ALA 276 26.726 24.582 8.707 1.00 66.01 

ATOM 2186 OT ALA 276 27.665 23.761 10.524 1.00 72.06 

ATOM 2187 OH2 WAT 500 7.288 0.582 30.446 1.00 12.93 

ATOM 2188 0H2 WAT 501 7.551 -2.385 30.926 1.00 14.51 

ATOM 2189 0H2 WAT 502 15.648 -3.549 26.581 1.00 12.66 

ATOM 2190 0H2 WAT 503 22.995 -4.531 32.505 1.00 14.00 

ATOM 2191 0H2 WAT 504 12.370 -2.139 29.668 1.00 12.75 

ATOM 2192 0H2 WAT 505 8.243 1.795 37.412 1.00 13.95 

ATOM 2193 0H2 WAT 506 12.211 -1.687 42.460 1.00 18.17 

ATOM 2194 0H2 WAT 507 12.547 0.038 27.856 1.00 14.35 

ATOM 2195 0H2 WAT 508 9.787 10.899 33.147 1.00 15.08 

ATOM 2196 0H2 WAT 510 11.744 7.842 36.365 1.00 15.19 

ATOM 2197 0H2WAT 511 9.925 -3.492 29.777 1.00 15.10 

ATOM 2198 0H2 WAT 512 9.590 8.537 34.696 1.00 17.43 

ATOM 2199 0H2 WAT 513 2.021 3.295 33.836 1.00 15.34 

ATOM 2200 0H2 WAT 514 6.563 13.229 27.860 1.00 18.19 

ATOM 2201 0H2 WAT 515 10.555 8.269 38.785 1.00 18.00 

ATOM 2202 0H2 WAT 516 10.674 15.405 22.497 1.00 19.56 

ATOM 2203 0H2 WAT 517 25.750 15.101 36.287 1.00 17.00 

ATOM 2204 0H2 WAT 518 4.386 6.182 34.218 1.00 15.43 

ATOM 2205 0H2 WAT 519 13.712 -1.171 31.851 1.00 19.69 

ATOM 2206 0H2 WAT 520 27.652 18.967 23.808 1.00 20.13 

ATOM 2207 0H2 WAT 521 14.113 -4.152 28.944 1.00 16.61 

ATOM 2208 0H2 WAT 522 8.101 9.135 38.813 1.00 23.68 

ATOM 2209 0H2 WAT 523 6.549 1.866 39.438 1.00 17.99 

ATOM 2210 0H2 WAT 524 8.387 10.486 30.847 1.00 15.91 

ATOM 2211 0H2 WAT 525 12.082 9.839 11.918 1.00 19.48 

ATOM 2212 0H2 WAT 526 18.804 -3.707 34.246 1.00 13.10 

ATOM 2213 0H2 WAT 527 13.250 13.468 39.304 1.00 19.10 

ATOM 2214 0H2 WAT 528 7.275 8.982 36.188 1.00 19.69 

ATOM 2215 0H2 WAT 529 5.361 7.284 36.859 1.00 17.02 

ATOM 2216 0H2 WAT 530 8.547 12.919 29.494 1.00 20.63 

ATOM 2217 0H2 WAT 531 33.657 6.673 29.562 1.00 19.62 

ATOM 2218 0H2 MAT 532 23.095 17.810 38.035 1.00 20.16 

ATOM 2219 0H2 WAT 533 7.044 4.516 40.668 1.00 18.41 

ATOM 2220 0H2 WAT 534 8.572 -2.181 21.497 1.00 19.99 

ATOM 2221 0H2 WAT 535 5.165 -3.897 30.946 1.00 16.72 

ATOM 2222 0H2 WAT 536 35.064 12.912 30;402 1.00 24.78 

.ATOM 2223 0H2 WAT 537 7.785 2.872 44.403 1.00 19.77 

ATOM 2224 0H2 WAT 538 2.503 10.234 33.144 - 1.00 23.38 

ATOM 2225 0H2 WAT 539 2.763 - 3.299 20.083 1.00 22.50 

ATOM 2226 0H2 WAT 540 6.475 6:912 39.440 1.00 22.13 
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